Professional Documents
Culture Documents
SAS Points
SAS Points
H1: U >= 19200 ye one sided h q ki greater than and lower than hai.. but agar equal or not equal
hota to 2 sided hoti.
H0: U>=50
H1: U<50
52. Actual value is diagonal and points are value jo standardize karne ke baad aayi hai.. agar
diagonal ke saath distributed h to normal distribution hai..
53. P is inversely proportional to alpha.
54. Jab bhi hum y=f(x) ki baat karte h to model use hota h var nahi..
55. Jab run or quit saath me likhe jate h to usko bolte h run quit group processing.
56. Jab model statement likha jayega tab run or quit saath mei likhna chahea. But Jayda tar proc
step mei nahi likhi jati.
57. Bulbwt ke liye H0 = U1=U2=U3=U4 and H0 = At least one is different.
58. Mean square model / mean square error = F value
59. When p is less then alpha then the variable is contributing and vice versa.
60. Adding variable add variability thereby increasing R^2(variables add karne pe variance kam
hoga or error ke chance kam ho jayenge)
61. Removing variable remove variability thereby decreasing R^2
62. Lsmeans = least square means = considers missing value unlike means.
63. Diffogram = tells differences of the proability.
proc univariate data=sashelp.class;
var height;
output out=mona pctlpts=67 pctlpre=P; /*pctlpre = percentile prefix is needed to run pctlpts*/
var weight;
output out=mona pctlpts= 0 to 100 by 5 pctlpre=P; /* Use by - to run in loop; Now it will
give value of V1 V5 V10 V15 etc.; By default, the stopper is 1 that's why previously the
run;
univariate ke saath output statement use karni hai .. or out bhi varna vo apne naam se ek output
file bana lega.
16-Nov-19
ETL – Extraction Transformation Load ------ data transfer team karti -Hashing aspect jis se
original data na dikhe.
Check for information loss - It should be avoided. (SME – Subject Matter Expert) – Identify total
number of observation and variable and mail to them to check ki data bara hai..
SKU – Stock Keeping Unit – Property of the merchant and not the credit card company.
Sanity check
Static File – based on some demographic --- compil client --- transaction unique record nahi hota..
Transaction – Frequency
Spend – Monetary
While running percentile procedure --- if percentile and values both are changing then it’s a
quantitative otherwise it’s a qualitative.
Q3+1.5IQR es se Jayda and Q1-1.5IQR es se kam range mei honge to data mei outlier honge. IQR =
Inter quartile range.
Missing values are treated with threshold value.. ex – agar kisi ki salary group mei max 20 lakh h to
bahar se koi aata h jiski 20 cr h to usko 20 lakh pe layenge.
We treat the outlier, first, however we make sure that missing values are not touched or replaced
while treating the outlier.
IV – Information Variable and tells association between the variable. How much potential the
variable has to separate your goods from bads.
Variable reduction mei p or alpha nahi aaya to contributing word ka use nahi karna
Proc cor association to btata hai lekin variable importance nahi bata pata.
yymon-------- format
Data cleansing --- Data Prep ---- Variable reduction ----- Divide data into training and validation---
detecting multicolinearnity via logistic and check VIF--- jiska VIF 3 se Jayda hai usko nikal do---
variable reduction based on threshold IV---- area under the curve should be same--- sin c value
should be changed.
Local maxima = jo value diff mie max ho.. bucket check kar lo.. ye 3 mei lie karega..
Product costing = 10 rs
Campaign costing = 1 rs