Professional Documents
Culture Documents
Ilovepdf Merged 3
Ilovepdf Merged 3
Resource Person L
Emailld : sumathiSS_cs@yahoo.co.in
Qualification : M.Sc.,M.Phit
Experience : 12 years
Cheyyar -6044W.
{r
*
PrinciPal'
'^H#;ltan''"'*'
*
Data science syllabus
Module 1: Python
Flhon is the most important and irecessary topic that every data scientist should have knowledge
about. In this section, our instructors will take you through the basics of Python and areas where
it can be used. You will learn how to use some of the current tools such as Numpy, Pandas, and
Matplotlib. Therefore, module I includes -
. Environment set-up
. Jupyter overview
. Plthon Numpy
. Python Pandas
. Plthon Matplotlib
Module 2: R
Used for statistical and data analysis, R programming language is one of the advanced statistical
languages used in data science. This module teaches you how to explore data sets using R. Here
you will learn -
. An introduction to R
. Data structures in R
. Data visualization with R
. Data analysis with R
Module 3: Statistics
When working with data, the knowledge of statistics is necessary and an important skill set that
'
you must have. In this module, you will learn - '
a Normal distribution
6 -fi;
. Test hypotheses
. Central limit theorem
. Confidence interval
. T-test
. Type I and II errors
. Student's T distribution
This lesson will help you understand how to establish a relationship between two or more
objects. ANOVA oi analysis of variance is used to analyze the differences among sample sets.
Here you will learn -
. Regression
. 'ANOVA
. R square
. Correlation and causation
This is a comprehensive module to help you understand how to make machines or computers
interpret human language. You will learn -
Module 8: Tableau
Tableau is a sophisticated business intelligence tool usqd for data visualization. In this lesson,
you will learn -
. Working with Tableau
. Deep diving with data and
. Creating charts
Ptr; e??fl06
f"dt,ba
'.ndo-America
I n Collegle'
* CheYYar'6Q4 4%,
'. Mappingdata in Tableau
Dashboards and stories
Data science skills that you will master from this course
E t 5
(9
() PrinciPal'
e<( f'l -/
g c-{
(t{
o.
'43ffii:affit*Y";'
oNl i
l
Department of Computer Science
Academic Y ear :2021-2022
Course Schedule - CSDS2l
30 Hours Training Schedule
Module 1: Python -
Environment set-up
Jupyter overview
Python Numpy
Module 2:
07.09.2021 3.45 pm to 4.45 pm R - An introduction to R
Data structures in R
Module 3:
08.09.2021 3.45 pm to 4.45 pm Statistics - Important statistical concepts used in data science
Difference between population and sample
Types of variables
3.45 pm to 4.45 pm
Measures of central tendency
T-test
13.09.2021 3.45 pm to 4.45 pm Type I and II erors
Student's T distribution
Module 5:
Regression and Anova
14.09.2021 3.45 pm to 4.45 pm Regression
ANOVA
Module 6:
Exploratory data analysis
Data visualization
Missing value analysis
+
\f
6,
(\'' P
+ 1"$s;l' 4N
' 17.09.2021 3.45 prn to 4.45 pm Python Scikit tool
Neural networks
Support Vector machine
18.09.2021 10.00 am to 3.45 pm
T.ogistic and linear regression
Decision tree classifier
Module 8:
20.09.2021 3.45 pmto 4.45 pm Tableau
Working with Tableau
2t.09.2021 3.45 pm to 4.45 pm Deep diving with data and connection
t ,
At--t h*t-t""-*:> {.
Coordinator
LLE
* "udfi,#b**
l
DEPARTMENT OF COMPUTER SCIENCB
ACADEMIC YEAR: 2021-2022
PROGRAM NAME:. DATA SCIENCE CSDS21
1. which of the following is the most important language for Data Science?
A Java B Ruby CR D None of the mentioned
2. Which of the following approach should be used to ask Data Analysis question?
A Find only one solution for particular problem
B Find out the question which is to be answered
C Find out answer from dataset without aiking question
D None of the mentioned
3. Which of the following is one of the key data science skills?
A Statistics B Machine Learning C Data Visualization D AII of the mentioned
4. Which of the following is characteristic of Processed Data?
A Data is not ready for analysis B All steps should be noted
C Hard io rr. for data analjrsis D None of the mentioned
5. The plot method on Series and DataFrame is just a simple wrapper around
A gplt.plot0 B plt.plotO C plt.plotgraph0 D none of the mentionbd
6. Which of the following value is provided by kind keyword for barplot?
A bar B kde c hexbin D none of the mentioned
7. Which of the following is the probability calculus of beliefs, given that beliefs follow certain
rules?
A Bayesian probability B Frequency probability
C Frequency inference . D Bayesian inference
8. Which of the following random variable that take on only a countable number of
possibilities?
A Discrete B Non Discrete C Continuous D All of the mentioned
9. Which of the following is also referred to as random variable?
A stochast B aleatory C eliette D all of the mentioned
'10. Which of the following function is aSsociated with a continuous random variable?
A pdf B pmv C pmf D all of the mentioned
I l. Which of the following value is the most common measure of "statistical significance"?
AP BA CL D All of the mentioned
12. What is the purpose of multiple testing in statistical inference?
A Minimize effors B Minimize false positives
C Minimize false negatives D Alt of the mentioned
13. Which of the following tool is used for constructing confidence intervals and calculating
standard errors for difficult statistics?
A baggyer B bootstrap C jackknife D none of the mentioned
c or
+
E
<i
q,*[d,
n Colle$en
erica
"'n ri r"r-Arfl at
I
I s,,Ef,t',^-,
-d Prin-'ciPal, -'
j
o
I
rYAF ''tt}ff;I,:Alf"oYrT,
DEPARTMENT OF COMPUTER SCIENCE
ACADEMIC YEAR: 2021-2022
PROGRAM NAME: DATA SCIENCE_ CSDS21
REG. Zag t{ u t g aa +
No: NAME: AE'TM T
%
YEAR/SEM, .TT /b
-_7-
DArE: Xg-"og , Lezi *
1.Which of the'following is the'most imp oft ant I an guagq. fo r D ata S cience?
AJava B Rubv @nJ D None of the menrioned
2. Which of the following approach should be used to ask Data Analysis question?
A Find only one solution forparticular problem ,i
*
@ina out the question which is to be unr*".edl
C Find out answer from.dataset without asking question
. D None of the mentioned
3. Which of the following is one of the key data science skills?
A Statistics B Machine Learning C Data Visualization@tt of the mentiong!, ,4
S * BA CL DAllofthementioned
11. What is the purpose of multiple testing in statistical inference?
A Minimize errors B Minimize false positives
C Minimize false negatives
{)+tt of the mentioned .,,{
12. Which of the following focusbs on the disqbvery of (previously) unknown properties on the
data?
@Data
mining B Big Data
"/1 CDatawrangling D Machine Learning
co
T
+ f, \,^
o PrlneiPat'
0
+
n-13.o,ffiJ:Hf3t?:.
13. Which of the following uses relatively small amount of data to estimate about bigger
population?
@inferent ial */ B Exploratory . C Causal D None of the
mentioned
14. Which of the following analysis is usually modeled by set of equations?
A Predictive B Causal of the mentioned
.15. Which of the following is the top most thing in data science?
A answer @uestion C data D none of the mentioned
16. Which of the following approach should be used if you can't fix the variable?
c8*
I ndo-American College,
Cheyyar - OA4 4OT
DEPARTMENT OF COMPUTER SCIENCE
" ACADEMICYEAR:2021-2022 \%. %
PROGRAM NAME: DATA SCIENCE- CSDS2l ,/F
REG. NO:2oFttuBeA? NAME: ItAd"*fi T"i L\
'A{f-
I BA cL Sa.il of the mentioned I
11. What is the purpose of multiple testing in statistical inference?
A Minimize effors B Minimize false positives
C Minimize false negatives \P-All of the mentioned I
12. Which of the following focuses on the discovery of (previously) unknown properties on the
data?
L
+ 7 Pri NC ipal,
E
d.
I ndo-Ameri ca n Col le$e,
Cheyyar - 604 4Q7e/
\\_
*
13. Which of the following uses relatively small amount of data to estimate about bigger
population?
rk[nferentialz' B Exploratory C Causal D None of the
mentioned
14. Which of the following analysis is usually modeled by determini.stic set of equations?
A Predictive B Causal .' G'lvlechanistic f Xtt of the mentioned
15. Which of the foilowing is the top most important thing in data science?
A answer E{-uestionr- C data D none of the mentioned
16. Which of the following approach should be used if you can't fix the variable?
\ rflandomi ze it B non stratify it/ C generalize it D none of the mentioned
17. Which of the following is a good way of performing experiments in data science?
A Measure variability B Generalize to the problem C Have Replication Wll of the ,/
mentioned
18. Which of the following data mining technique is used to uncover patterns in data?
A Data bagging B Data booting C Data merging WDataDred,ging .r/
19. Which of the following operations are supported on Time Frames?
%ritlxmax B ixmax C ixmin D none of the mentioned
20. Numeric reduction operation for timedelta64[ns] will return objects
A Timeseries B Timeplus ffiimedelta DNone of the .:,(
mentioned
I CA N
ri
<-
*
Pit: ?22008 l?t
*
5&*
-PrinciPal'
'"Hlr:;:'3sfl[ff
)
\ {
DEPARTMENT OF' COMPUTER
ACADEMIC YE AR: 2021-2022
PR.OGRANI NAMF]: DATA SCIENCE_ CSDS21
REG.No: ?-05 p?tS o oy NAME: Ke sA FA\'? p's
i-
YEAR/sEM, E lg- DArE: 76-91nL! -
1. Which of the following is the most important lagg1age for Data Science?
A Java B Ruby 4p.,.2 D None of the mentioned
2. Which of the following approach should be used to ask Data Analysis question?
A Ei{ld only one solution for particular problem
;dr,"a out the question which is to be answdred
C Find out answer from dataset without asking question
D None of the mentioned
3 Which of the following is one of the key data science skills?
A Statistics B Machine Learning CDataVisualization dirrne mentioned
4. Which of the following is characteristic of Processed Data?
A Data is not ready for analysis
OWfrtt steps should be noted
C Hard to use for data analysisD None of the mentioned
5. The plot metho don Series and DataFrame is just a simple wrapper around
A gplt.plot0 ufn.ptotg " . C plt.plotgraph0 D none of the mentioned
6. Which of owing value is provided by kind keyword for barplot?
*{ar B kde C hexbin D none of the mentioned
7. Which of the variable that take on only a countable number of
possibilities?
g.discrete B Non Discrete C Continuous D All of the mentioned
8. Which of the following is also referred t-o-as random variable?
A stochast ,ffl"utrry ...,'r C eliette D all ofthe mentioned
9" Which of the following function is associated with a continuous random variable?
A pdf tUfp--, y- C pmf D all of the mentioned
10. Which ofthe value is the most common measure of "statistical signiflcance"?
Yf BA CL D All of the mentioned
I l. What is the purpose of multiple testing in statistical inference?
A Minimize errors B Minimize false positives
C Minimize false negatives Vdtt of the mentioned
12. Which of the following focuses on the discovery of (previously) unknown properties on the
data?
fu{atamining B Big Data C Data wrangling D Machine Leaming
E t
(g
(}
t e ;
\
L)
(Y
oa
5
o,l -u'" PrinciPal"
o-
"-ndpAnr *(ican Coile$e'
I
J
13. Which of the following uses relatively small amount of data to estimate about bigger
population?
A inferential dxploratory * C Causal D None of the
mentioned
14. Which of the following analysis is usually modeled by deterministic set of equations?
A Predictive B Causal edfrechanistic D All of the menfioned
15. Which of the following is the top thing in data science?
A answer C data D none of the mentioned
16. Which of the fol bpproach should be used if you can't fix the variable?
non stratify it C generulizeit D none of the mentioned
17. Which of the following is a good way of performing experiments in data
A Measure variability B Generalize to the problem C Have Replication of the
. mentioned
18. Which of the following data mining technique is used to uncover patterns in data?
A Data bagging B Data booting CDatamerging Vdata
19. Which of operations are supported on Time Frames?
B ixmax C ixmin D none of the mentioned
20. Numeric reduction operation for timedelta64[ns] will return objects.
A Timeseries B Timeplus vfiry*t*) D None of the
mentioned
LLE
*
1ffi*
PrinciPal'
indo-American C9[13er
* "'tn*war - 604 4A7:
.)
f.AA&* t$*i$tr "6",$r*rdx
fte,${3gn,sE* {*sre* r S**:ry$mrl I $ft & t 3 t*s? f rlG.*,efa
FEEDBACK FORM
COURSE NAME WITH CODE "*,tenp- nSJDg2_)
DEPARMENT Coq,put €t{ =gL\en,(l
DURATION IN HOURS _3ol.frr4
DATE OF COMPLETION \o, 0ts.-zt z-
STUDENT NAME & REG NO s{rqunp, t{ r-rlSz-c' P) 9-t'e
SIGNATURE WITH DATE
1.H ow interesting was the courie is?
a. Strongly Agree WAgree c. Moderate d. Disagree e. Strongly Disagree
2. How this course usefut for you?
Frin
6 !
+ n Col teE€'
lndo- Ameriea
eh eyyar - s a4,407
*rt#cp*mme ri ccx ffi ffi mfrfiffi#*
3{ <*.. 1 *-lt t ;*,,r+,,<3**a,* {' *r**,1}* g,*.
\€e r€-
**e.rs$i**d by' S-,'$,*\efi W'{}{tsde
tx:l*}r*
#*e.*.ngrr*ww$ L$nd*,r Se'cat{3fl A qry si , 7 {ry} erf [j{3*:J{*L
FEEDBACK FORM
COURSE NAME WITH CODE Potn SLrsrtcE c-g Dszl
CDro ?u'rER
glBuct
DEPARMENT
DURATION IN HOURS 3o
DATE OF COMPLETION uc, oS ' l-oti
YuvEeIS- k
STUDENT NAME & REG NO z.o-siaritot6
.t-
SIGNATURE WITH DATE c)
1. How interesting was the course is?
\ +
\)
r\
b f,)rAr4**,
P"rinciPal'
n Collefe'
+
"';;;rar
I ndo-America
- 6o4 4o7
-
}\ Lli "t-c;{ .r:)(}lFter (d:};i*$:(:
$}*ri1?ffi{T *t.lt$ l&.iG*w.tmd t* Tl$$Ftt-}"a.",1{*'E-r4-r'"',r-&${ vJSi*i$\rXffiSi'{-Y" r"d*lierre '
-S.-
l{,**r,*drtxd irp S*}\, S !,tditt} {3r,t*dx
,*teg,r:{trslw*d, {jrrdet Ss*,{irpl'l 3 tf.} "B -* X {&,} sf .*{SC. Anf-
FEEDBACK FORM
COURSE NAME WITH CODE .|,-:Sa, 9c rc-nto a g &\ J-rr
€)
'\-,/ Strongly Agree b. Agree c. Moderate d. Disagree e. Strongly Disagree
AME 6,Fefi"-
i r;;+Urp6l,
+ ,i icjo-Arrerican CollegG,
Cheyyar - 6O4,.'- '
_
{F
DEPARTMENT OF COMPUTER SCIENCE
INDO - AMERICAN COLLEGE,
CHEYYAR