Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 36

Phn 3

Quick review
Th mc hin ti
. pwd
D:\Stata12_WinX86_x64\Stata12_WinX86_x64

Thay i th mc lm vic
ng dn c du cch, thm
. cd "E:\TAI LIEU\Stata"
E:\TAI LIEU\Stata

ng dn khng c du cch, khng cn


. thm
cd E:\
E:\
log file
Lu tt c kt qu trn mn hnh
log using filename [, append replace [text|smcl]
Ch lu cc cu lnh trn mn hnh
cmdlog using filename [, append replace]
. cd "E:\TAI LIEU\Stata\Buoi 3"
E:\TAI LIEU\Stata\Buoi 3
. log using test.doc, text

ng (m) log (cmdlog) file tm thi:


log (cmdlog) on (off)
ng log (cmdlog) file:
log (cmdlog) close
do file
File d liu gc: zipped li hoc ch
Read-only
Dofile nn tch ring phn phn tch v
phn lm sch s liu. V d: ADB-
cleandata.do; ADB-analysis.do
do file
Mi dofile c 1 log file ring, nn tn
ging nhau
Hoc mt log file chung
. clear
. log using ADB.doc, text
. do ADB-clean
. do ADB-merge
. do ADB-regressions
. log close
To dofile
Copy cc lnh vo dofile editor
Dng cmdlog
If/in
Lnh if ng cui cu lnh, trc du
,

Lnh in ng cui cu lnh, trc du


,
in 4/5: quan st t 4 n 5
in f/4: 4 quan st u tin
in -4/l: 4 quan st cui cng
STATA color-coded

var1: bin numeric c dn nhn (mu


xanh)
var2: bin string (mu )
var3: bin numeric (mu en)
var4: bin string (mu )
To bin mi
. gen [newvar]= [expression]
To bin mi
. gen newvar = (var1==1 & var2==1)
. gen newvar = (var1==1 & var2<26)
To bin mi
. tab var1, gen (var2)
recode
recode
. recode rep77 rep78 (1 2 = 1 "Below average") ///
(3 = 2 Average) ///
(4 5 = 3 "Above average") ///
(else=9 (nonmissing= 9 No) ///
No) (missing=99 Missing), ///
pre(new) label(newrep)

Nhn gi tr To bin mi: newrep77,


mi newrep78. Tng ng vi
gen (newrep77 newrep78)
replace
. replace oldvar =exp [if] [in]
Nhn data, ch thch

label data Text


notes: text
notes tenbien: text
i tn bin, nhn bin

i tn bin
rename old new, [options]
rename (old1 old2 ...) (new1 new2 ...), [options]
rename old1 old2 ..., {UPPER|lower|Proper}[options]
options: renumber, renumber (#)
addnumber, addnumber (#)
i tn bin

jana1 jana2 jana3 rename jan* *1 jana11 jana21 jana31


jana1 jana2 jana3 rename jan* * a1 a2 a3
jana1 jana2 jana3 rename * *jan jana1jan jana2jan
jana3jan
Nhn bin

label variable [varname] Text


Nhn gi tr ca bin
Bc 1: To nhn cho cc gi tr
label define label1 1 Dong y ///
2 Khong dong y ///
3 Khong biet
Bc 2: Dn nhn gi tr cho cc bin
label values var1 label1
label values var2 label1
hoc
label values var1 var2 label1
Bc 3: Thay i nhn gi tr
label define label1 4 Tu choi tra loi, add
label define label1 4 Khong tra loi, modify
Bin h thng
_N: tng s quan st
_n: s th t ca quan st
To bin h thng theo nhm
by major: gen idmajor=_n
To bin lagged, forward
gen lag1_year=year[_n-1]
gen for1_year=year[_n+1]
gen lag2_year=year[_n-2]
gen for2_year=year[_n+2]
Gi tr missing
C 27 gi tr missing c th c trong
Stata, mc nh gi tr missing l du
chm(.)
Cn li l 26 gi tr missing trng vi 26
k t trong bng ch ci nhng c du
chm ng trc(.a, .b, .c,..).
Cc gi tr missing trong Stata c coi
nh cc s v cng ln.
Gi tr missing
V d: chng ta mun tnh tnh
summarize (m t thng k) vi d liu
auto, ta tnh mean ca bin price, theo
rep78.
sum price if rep78>3 Kt qu bng 1
sum price if rep78>3 & rep78 <. Kt qu bng 2

Variable Obs Mean Std. Dev. Min Max


price 34 6073 2315.435 3748 12990
Variable Obs Mean Std. Dev. Min Max
price 29 6011.38 2055.312 3748 11995

sum price if rep78>3 & !missing(rep78)


sum price if rep78>3 & missing(rep78)==0
Gi tr missing
Chuyn missing sang dng s v ngc
li
. mvencode varlist [if] [in], mv(#|mvc=# [\
mvc=#...] [\ else=#])
. mvdecode varlist [if] [in], mv(numlist |
numlist=mvc [\ numlist=mvc...])

valid_numbers< . < .a < ... < .z.


Mt s p kiu d liu
Hm int(), float(), string(). chng ta mun
p t kiu d liu ny sang kiu d liu
khc Vy n khc g vi cc hm chuyn
kiu nh destring() v tostring()?
V d: display int(3.45) kt qu s l 3
Khong bin: var1-var5 (var1 var2 var3
var4 var5).
Bn cng c th dng cc k t ?, * thay
th cho cc k t; v d: var* - s tm tt c
cc bin bt u bng var.
Hm nh dng - format
C php: format varlist %fmt
Vi %fmt:
%w.df: w l chiu di ca s , d l s ch s sau phn
thp phn
v d: 1.5235 nu nh dng %8.2f 1.52
%w.0g: w chiu di ca s
int %8.0g
byte %8.0g
long %12.0g
float %9.0g
double %10.0g
str# %#s
M t thng k

Cc lnh: summarize, tabulate,


tabstat,tab1, tab2.
M t thng k vi cc bin lin tc.
Bng tn sut v bng tng quan quan
2 chiu(cross-tabulation).
Export d liu
Frequencies
tabulate varname [if] [in] [,options]
options:
missing: tnh missing nh cc gi tr khc
nofreq: khng hin th tn s
nolabel: khng hin th nhn gi tr
sort: sp xp bng theo tn s gim dn
Frequencies and descriptive
statistics
table rowvar [colvar [supercolvar]] [if] [in] ,
options]
options:
contents (freq mean sd min max median)
format (help table)
Frequencies and
crosstabulations
tab var1 var2, sum (var3)
Crosstabs
tabulate varname1 varname2 [if] [in] [, options]
tab2 varlist [if] [in] [weight] [, options]
Options:
col
row
cell
nofreq
missing
nolabel
Descriptive statistics
tabstat varlist [if] [in] [, options]
Options:
by (varname)
stat (mean min max median sd)
col (var): bin cc ct (default)
col (stat): thng k cc ct
nototal
missing
Descriptive statistics
tabstat varlist [if] [in] [, options]
Three way crosstabs
bysort var3: tab var1 var2, col row
Three way crosstabs
bysort var3: tab var1 var2, sum (var4)
Collapse
collapse (stat1) var1 (stat2) var2 (stat3)
newvar1=var1 newvar2=var2, by(varlist)
Quick review
Command
summarize y1 y2 y3
summarize y1 y2 y3, detail
summarize y1if x1> 3 & !
missing(x2)
tabstat y1, stats(mean sd n)
tabstat y1, stats(min p50 max)
by(x1)
tabulate x1
tabulate x1, sort miss
tab1 x1 x2 x3 x4
tabulate x1 x2
Quick review

Command
tabulate x1 x2, column
tabulate x1 x2, missing row all
tab2 x1 x2 x3 x4
tabulate x1, sum(y)
tabulate x1 x2, sum(y) means
by x3, sort: tabulate x1 x2
table y x2 x3, by(x4 x5)
contents(freq)
table x1 x2, contents(mean y1
median y2)

You might also like