# data analysis of US public use census data using stata

This is a data analysis project and the coder has to use the program STATA.

The data that is going to be analyzed is from the Census data years 1980,1990 and 2000. This data is publicly accesible from <[url removed, login to view]>

The project requires the person to be able to manipulate large datasets and compute basic statistics (means, standard deviations,etc), tabulations, regressions (OLS, fixed effects, logit, probit, instrumental variables). All these data analysis should be done using the software STATA (as I would like to check the actual code used to get all statistics and results).

It is a project about analyzing fertility patterns for married women. In particular, I am interested in knowing what is the probability of having another child for women with certain sex composition of their family. For example, for women that have at least one child and he is a boy, how many of them have more than one? And which fraction of them have more than one if the oldest sibling is a girl? And then, construct these same type of analysis when the two oldest siblings are boy-boy, boy-girl, girl-boy, girl-girl. Then, the same kind of analysis for the 8 combinations generated when the woman has at least 3 children (boy-boy-boy, boy-boy-girl, ...). And final this same analysis for women that have at least 4 children. This analysis will be done with census data for years 1970, 1980 and 1990.

The final output consists of:

1) tables in excel with results

2) Clear and detailed explanations about how the samples were constructed (using which variables and which conditions)

3) The actual code used

4) Explanations about the actual dataset (for example, sthg was done in certain way because the dataset only contains this type of variable for certain year).

I will give very clear and more detailed explanations about how to construct the samples, etc while we start working in the project.

## Deliverables

What is required is the final output of the project which (as mentioned before) consists of:

1) tables in excel with results

2) Clear and detailed explanations about how the samples were constructed (using which variables and which conditions)

3) The actual code used

4) Explanations about the actual dataset (for example, sthg was done in certain way because the dataset only contains this type of variable for certain year).

## Platform

The analysis should be done in STATA.

Project ID: #3629584

## 1 freelancer is bidding on average \$85 for this job

sursudevelop

See private message.

\$85 USD in 10 days
(58 Reviews)
4.8