Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 3

Youtube channel - Yashvi Patel

How to get started with data science


PHASE - 1 ( Fundamentals)
1. Learn Python
1. Google python crash course
2. Code-with-harry
3. Geeks-for-geeks
4. Official documentation
2. Maths
1. 3blue1brown
3. Statistics and Probability
1. Khan Academy - Stats & Prob
4. Python libraries - Numpy , Pandas , Matplotlib , Seaborn , Plotly

PHASE - 2 (Exploratory data analysis)

1. Kaggle data visualisation course


2. https://www.kaggle.com/pmarcelino/comprehensive-data-
exploration-with-python
3. https://www.kaggle.com/parulpandey/10-advanced-plots-in-
matplotlib
4. https://www.kaggle.com/parulpandey/breathe-india-covid-19-
effect-on-pollution
5. https://www.kaggle.com/parulpandey/10-simple-hacks-to-
speed-up-your-data-analysis
6. https://www.kaggle.com/headsortails/back-to-predict-the-
future-interactive-m5-eda

PHASE - 3 (Data Preprocessing)

● Handling missing values


● Handling outliers
● Feature scaling
● Encoding categorical features

Resources -
1. https://www.kaggle.com/parulpandey/a-guide-to-handling-missing-
values-in-python
2. https://www.kaggle.com/nareshbhat/outlier-the-silent-killer
3. https://www.analyticsvidhya.com/blog/2020/07/types-of-feature-
transformation-and-scaling/
4. https://www.analyticsvidhya.com/blog/2020/08/types-of-
categorical-data-encoding/

PHASE - 4 ( Feature selection and Feature engineering )


1. https://www.kaggle.com/kanncaa1/feature-selection-and-data-
visualization
2. https://www.kaggle.com/kashnitsky/topic-6-feature-engineering-
and-feature-selection
3. https://www.kaggle.com/davidcairuz/feature-engineering-lightgbm

PHASE - 5 ( Machine learning )


1. Machine learning by Andrew ng
2. Applied Machine learning - University of Michigan
3. Kaggle Micro courses
4. Scikit-learn official guide
5. https://www.kaggle.com/shivamb/data-science-glossary-on-kaggle

PHASE - 6 ( Evaluation Metrics )


1. https://www.kaggle.com/getting-started/221303
2. https://www.kaggle.com/getting-started/228810

PHASE - 7 ( Hyper- parameter tuning )


1. https://www.kaggle.com/prashant111/a-guide-on-xgboost-
hyperparameters-tuning?scriptVersionId=38814952
2. https://www.kaggle.com/pavansanagapati/automated-
hyperparameter-tuning

PHASE - 8 ( End - To - End Project )


● Django
● Flask
● Streamlit

Tips-

1. Utilise Kaggle as much , practice on a variety of datasets / competitions , Kaggle


top notebooks , discussion forums of competitions.
2. Don’t skip maths behind the algorithm ,why you are using this particular algorithm
3. Practice past interview questions ( krish naik mock interview series)
4. Listen podcasts and read blogs
5. Try to read some research papers and implement them ( paper with code )

Datascience Blogs -
● https://towardsdatascience.com/
● https://www.analyticsvidhya.com/
● https://www.kdnuggets.com/
● https://ai.googleblog.com/
● https://ai.facebook.com/blog/?page=1
Podcasts -
● https://towardsdatascience.com/top-20-podcasts-for-data-science-
83dc9e07448e

You might also like