Professional Documents
Culture Documents
Session 4 (21-22 October Tasks) For STATA Section 1 Data Management
Session 4 (21-22 October Tasks) For STATA Section 1 Data Management
Section 2
1. summarize ln_wage
2. Create a histogram of lnwage and comment on distribution.
3. Try following commands
a. table union, contents(mean wage sd wage)
b. table collgrad, contents(mean wage sd wage)
4. Get crosstab of union and College graduate, What percentage of college graduates are union
member? (Answer in the do file)
1. Obtain the association through a scatter plot for wage and age; wage and tenure
2. Obtain the association through a scatter with regression line for wage and tenure
3. Obtain Correlations between variable wage and age; wage and tenure using correlation
matrix
Section 4 Graphs
Section 5
15. Create a new variable wage_cat; where wage <10 is “low”, Wage 10.01-30
is medium, >30 is “High”
4. Try Regression models with wage as dependent variable and independent variables
being union collgrad tenure race. Note race is a string variable, it needs to be converted
as numeric variable with the help of encode command.