PYTHON Introduction to Data Science Tuple Immutable concept len() || count() || index() Introduction to Data Science Forward indexing Discussion on Course Curriculum Backward Indexing Introduction to Programming Dictionary and Dictionary comprehension Python – Basics create a dictionary using variable Introduction to Python: Installation and keys:values concept Running len() || keys() || values() || items() (Jupyter Notebook, .py file from terminal, get() || pop() || update() Google Colab) comparision of datastructure Data types and type conversion Introduce to range() Variables pass range() in the list Operators range() arguments Flow Control : If, Elif, Else For loop introduction using range() Loops Python Identifier Functions Building Funtions (print, type, id, sys, len) Inbuilt vs User Defined Python - Data Types & Utilities User Defined Function Function Argument List, List of Lists and List Comprehension Types of Function Arguments List creation Actual Argument Create a list with variable Global variable vs Local variable List mutable concept Anonymous Function | LAMBDA len() || append() || pop() insert() || remove() || sort() || reverse() Packages Forward indexing Backward Indexing Map Reduce Forward slicing OOPs Backward slicing Step slicing Class & Object:
Set what is mean by inbuild class
how to creat user class SET creation with variable crate a class & object len() || add() || remove() || pop() __init__ method union() | intersection() || difference() python constructor Tuple constructor, self & comparing objects instane variable & class variable TUPLE Creation Create Tuple with variable Methods: what is instance method Data Frame Attributes what is class method Data Frame Methods what is static method Rename Column & Index Accessor & Mutator Inplace Parameter Handling missing or NaN values Python DECORATOR: iLoc and Loc how to use decorator Data Frame – Filtering inner class, outerclass Data Frame – Sorting Inheritence Data Frame – GroupBy Merging or Joining Polymorphism: Data Frame – Concat DataFrame - Adding, dropping columns & duck typing rows operator overloading DataFrame - Date and time method overloading DataFrame - Concatenate Multiple csv files method overridding Magic method Numpy Abstract class & Abstract method Iterator Introduction, Installation, pip command, Generators in python import numpy package, ModuleNotFoundError, Famous Alias name to Python - Production Level Numpy Fundamentals – Create Numpy Array, Array Error / Exception Handling Manipulation, Mathematical Operations, File Handling Indexing & Slicing Docstrings Numpy Attributes Modularization Important Methods- min(),max(), sum(), Pickling & Unpickling reshape(), count_nonzero(), sort(), flatten() etc., Pandas adding value to array of values Diagonal of a Matrix Introduction, Fundamentals, Importing Trace of a Matrix Pandas, Aliasing, DataFrame Parsing, Adding and Subtracting Matrices Series – Intro, Creating Series Object, Empty "Statistical Functions: numpy.mean() Series Object, Create series from numpy.median() List/Array/Column from DataFrame, Index in numpy.std() Series, Accessing values in Series numpy.sum() NaN Value numpy.min()" Series – Attributes (Values, index, dtypes, size) Filter in Numpy Series – Methods – head(), tail(), sum(), Matplotlib count(), nunique() etc., Date Frame Introduction Loading Different Files Pyplot Figure Class Axes Class lmplot() function Setting Limits and Tick Labels Seaborn Facetgrid() function Multiple Plots Multi-plot grids Legend Statistical Plots: Different Types of Plots: Color Palettes: Line Graph Faceting: Bar Chart Regression Plots: Histograms, Distribution Plots Scatter Plot Categorical Plots: Pie Chart Pair Plots 3D Plots Scipy Working with Images Customizing Plots Signal and Image Processing (scipy.signal, scipy.ndimage): Seaborn Linear Algebra (scipy.linalg): catplot() function Integration (scipy.integrate) stripplot() function Statistics (scipy.stats): boxplot() function Spatial Distance and Clustering (scipy.spatial): violinplot() function Statsmodels pointplot() function barplot() function Linear Regression (statsmodels.regression): Visualizing statistical relationship with Time Series Analysis (statsmodels.tsa): Seaborn relplot() function Statistical Tests (statsmodels.stats) scatterplot() function Anova (statsmodels.stats.anova): regplot() function Datasets (statsmodels.datasets): Mathematics Set Theory Binomial, Poisson, Normal Distribution, Standard Normal Distribution Data Representation & Database Operations Guassian Distribution, Uniform Distribution Combinatorics Z Score Skewness Feature Selection Kurtosis Permutations and Combinations for Sampling Geometric Distribution Hyperparameter Tuning Hyper Geometric Distribution Experiment Design Markov Chain Data Partitioning and Cross-Validation Linear Algebra Probability Linear Equations Basics Matrices(Matrix Algebra: Vector Matrix Theoretical Probability Vector matrix multiplication Matrix matrix Empirical Probability multiplication) Addition Rule Determinant Multiplication Rule Eigen Value and Eigen Vector Conditional Probability Total Probability Euclidean Distance & Manhattan Distance Probability Decision Tree Calculus Bayes Theorem Sensitivity & Specificity in Probability Differentiation Bernouli Naïve Bayes, Gausian Naïve Bayes, Partial Differentiation Multinomial Naïve Bayes Max & Min
Distributions Indices & Logarithms
STATISTICS Introduction Dependent Variable Independent Variable Population & Sample Control Moderating & Mediating Reference & Sampling technique Frequency Distribution Table Types of Data Nominal, Ordinal, Interval, Ratio Qualitative or Categorical – Nominal & Ordinal Types of Variables. Quantitative or Numerical – Discrete & Categorical Variables - Nomial variable & Continuous ordinal variables Cross Sectional Data & Time Series Data Numerical Variables: discreate & continuous Measures of Central Tendency Dependent Variable Independent Variable Mean, Mode & Median – Their frequency Control Moderating & Mediating distribution Frequency Distribution Table Descriptive statistic Measures of symmetry Relative Frequency, Cumulative Frequency skewness (positive skew, negative skew, zero Histogram skew) Scatter Plots kurtosis (Leptokurtic, Mesokurtic, Range Platrykurtic) Calculate Class Width: Measurement of Spread Create Intervals Count Frequencies Range, Variance, Standard Deviation Construct the Table
Measures of variability Correlation, Regression & Collinearity
Interquartile Range (IQR): Pearson & Spearman Correlation Methods
Mean Absolute Deviation (MAD) Regression Error Metrics Coefficient of variation Covariance Others
Levels of Data Measurement Percentiles, Quartiles, Inner Quartile Range
Different types of Plots for Continuous, Nominal, Ordinal, Interval, Ratio Categorical variable Box Plot, Outliers Variable Confidence Intervals Types of Variables. Central Limit Theorem Categorical Variables - Nomial variable & Degree of freedom ordinal variables Bias and Variance in ML Numerical Variables: discreate & continuous Entropy in ML Inferential Statistics
Information Gain Hypothesis Testing: One tail, two tail and p-
value Surprise in ML Formulation of Null & Alternate Hypothesis Loss Function & Cost Function Type-I error & Type-II error Statistical Tests: Mean Squared Error, Mean Absolute Error – Sample Test Loss Function ANOVA Test Huber Loss Function Chi-square Test Cross Entropy Loss Function Z-Test & T-Test SQL Introduction SQL Commands
DBMS vs RDBMS Create
Intro to SQL Insert SQL vs NoSQL Alter, Modify, Rename, Update MySQL Installation Delete, Truncate, Drop Grant, Revoke Keys Commit, Rollback Primary Key Select Foreign Key SQL Clause Constraints Where Unique Distinct Not NULL OrderBy Check GroupBy Default Having Auto Increment Limit
Data Definition Language (DDL) Aggregate Functions
Data Query Language SQL Joins Data Manipulation Language (DML) Data Control Language Inner Join & Outer Join Transaction Control Language Left Join & Right Join Self & Cross Join Natural Join EDA & ML EDA • Feature Engineering – Adding new features as per requirement, Modifying the data Univariate Analysis • Data Cleaning – Treating the missing values, Bivariate Analysis Outliers Multivariate Analysis • Data Wrangling – Encoding, Feature Transformations, Feature Scaling Data Visualisation • Feature Selection – Filter Methods, Wrapper Various Plots on different datatypes Methods, Embedded Methods Plots for Continuous Variables • Dimension Reduction – Principal Component Plots for Discrete Variables Analysis (Sparse PCA & Kernel PCA), Singular Plots for Time Series Variables Value Decomposition • Non Negative Matrix Factorization ML Introduction Regression What is Machine Learning? Types of Machine Learning Methods • Introduction to Regression • Supervised Learning • Mathematics involved in Regression • Unsupervised Learning • Regression Algorithms: • Reinforcement Learning) • Simple Linear Regression Classification problem in general • Multiple Linear Regression • Polynomial Regression Validation Techniques: CV,OOB • Lasso Regression Different types of metrics for Classification • Ridge Regression Curse of dimensionality • Elastic Net Regression Feature Transformations Feature Selection Evaluation Metrics for Regression: Imabalanced Dataset and its effect on Classification • Mean Absolute Error (MAE) Bias Variance Tradeoff • Mean Squared Error (MSE) • Root Mean Squared Error (RMSE) Important Element of Machine Learning • R² • Adjusted R² Multiclass Classification Classification • One-vs-All • Overfitting and Underfitting • Introduction • Error Measures • K-Nearest Neighbors • PCA learning • Logistic Regression: • Statistical learning approaches • Implementation and Optimizations • Introduce to SKLEARN FRAMEWORK • Stochastic gradient descent algorithms Data Processing • Finding the optimal HyperParameters • Creating training and test sets, Data scaling through Grid Search and Normalisation • Support Vector Machines (Linear SVM): Linear support vector machines • Accuracy & F1 Score • Scikit-learn implementation • Precision & Recall • Linear Classification • Sensitivity & Specificity • Kernel-based classification • True Positive Rate, False Positive Rate Radial Basis Function • ROC & ROC_AUC Polynomial Kernel Clustering Sigmoid Kernel Custom Kernels Introduction • Non-linear examples • 2 features forms straight line & 3 features K-Means Clustering: forms plane • Finding the optimal number of clusters • Hyperplane and Support vectors • Optimizing the inertia • Controlled support vector machines • Cluster instability • Support vector Regression • Elbow method • Kernel SVM (Non-Linear SVM) • Naives Bayes: Hierarchical Clustering • Bayes theorem • Naive Bayes Classifiers Agglomerative clustering • Naive Bayes in scikit learn ( Bernoulli DBSCAN Clustering Naive Bayes, Mulitnomial Naive Bayes, Guassian Naive Bayes)" Association Rules • Decision Trees: • Market Basket Analysis Binary Decision Trees • Apriori Algorithm Binary decisions CART Algorithm Recommendation Engines Impurity measures (Gini impurity index, Cross-entropy impurity index, • Collaborative Filtering: Misclassification impurity index) • User based collaborative filtering Feature importance • Item based collaborative filtering Decision tree classification with scikit- • Recommendation Engines learn Time Series & Forecasting • Random Forest / Bagging: • Random Forests and Features • What is Time series data importance in Random Forest • Different components of time series data • AdaBoost • Stationary of time series data • Gradient tree boosting • ACF, PACF • Voting classifier • Time Series Models: • Ensemble:Bagging • AR • Ensemble:Boosting" • ARMA • Ada Boost • ARIMA • Gradient Boost • SARIMAX • XG Boost • Evaluation Metrics for Classification: Model Selection & Evaluation • Confusion Matrix Over Fitting & Under Fitting • Biance-Variance Tradeoff Others o Cross Validation: • Dummy Variable, Onehotencoding o Stratified Cross validation o K-Fold Cross validation • gridsearchcv vs randomizedsearchcv • Hyper Parameter Tuning ML Pipeline • Joblib And Pickling ML Model Deployment in Flask PowerBI Introduction Hierarchies, Filters
• Power BI for Data scientist • Creating Hierarchies
• Types of reports • Drill Down options • Data source types • Expand and show • Installation • Visual filter,Page filter,Report filter • Drill Thru Reports Basic Report Design Power Query • Data sources and Visual types • Canvas and fields • Power Query transformation • Table and Tree map • Table and Column Transformations • Format button and Data Labels • Text and time transformations • Legend,Category and Grid • Power query functions • CSV and PDF Exports • Merge and append transformations
Visual Sync, Grouping DAX Functions
• Slicer visual • DAX Architecture,Entity Sets
• Orientation,selection process • DAX Data types,Syntax Rules • Slicer:Number,Text,slicer list • DAX measures and calculations • Bin count,Binning • Creating measures • Creating Columns Deep Learning Deep learning at Glance • Vanishing Gradient Descend • Fine-tuning neural network hyperparameter • Introduction to Neural Network • Number of hidden layers, Number of neurons • Biological and Artificial Neuron per hidden layer • Introduction to perceptron • Activation function • Perceptron and its learning rule and • INSTALLATION OF YOLO V8, KERAS, THEANO drawbacks • Multilayer Perceptron, loss function PY-TORCH Library • Neural Network Activation function RNN (Recurrent Neural Network) Training MLP: Backpropagation • Introduction to RNN Cost Function • Back Propagation through time • Input and output sequences Gradient Descent Backpropagation - Vanishing and • RNN vs ANN Exploding Gradient Problem • LSTM (Long Short-Term Memory) Introduce to Py-torch • Different types of RNN: LSTM, GRU • Biirectional RNN Regularization • Sequential-to-sequential architecture (Encoder Decoder) Optmizers • BERT Transformers Hyperparameters and tuning of the same • Text generation and classification using Deep Learning TENSORFLOW FRAMEWORK • Generative-AI (Chat-GPT)
• Introduction to TensorFlow Basics of Image Processing
• TensorFlow Basic Syntax • TensorFlow Graphs • Histogram of images • Variables and Placeholders • Basic filters applied on the images • TensorFlow Playground Convolutional Neural Networks (CNN) ANN (Artificial Neural Network) • ImageNet Dataset • ANN Architecture • Project: Image Classification • Forward & Backward Propagation, Epoch • Different types of CNN architectures • Introduction to TensorFlow, Keras • Recurrent Neural Network (RNN) • Using pre-trained model: Transfer Learning Natural Language Processing (NLP) Natural Language Processing (NLP) • TextBlob • Installing textblob library • Text Cleaning • Simple TextBlob Sentiment Analysis Example • Texts, Tokens • Using NLTK’s Twitter Corpus • Basic text classification based on Bag of Words Spacy Library
Document Vectorization • Introduction, What is a Token, Tokenization
• Stop words in spacy library • Bag of Words • Stemming • TF-IDF Vectorizer • Lemmatization, • n-gram: Unigram, Bigram • Lemmatization through NLTK • Word vectorizer basics, One Hot Encoding • Lemmatization using spacy • Count Vectorizer • Word Frequency Analysis • Word cloud and gensim • Counter • Word2Vec and Glove • Part of Speech, Part of Speech Tagging • Text classification using Word2Vec and Glove • Pos by using spacy and nltk • Parts of Speech Tagging (PoS Tagging or POST) • Dependency Parsing • Topic Modelling using LDA • Named Entity Recognition(NER) • Sentiment Analysis • NER with NLTK Twitter Sentiment Analysis Using Textblob • NER with spacy Computer Vision Human vision vs Computer vision OPEN AI
• CNN Architecture • Introduction to Open AI
• CONVOLUTION – MAX POOLING – FLATTEN • Generative AI LAYER – FULLY CONNECTED LAYER • Chat Gpt (3.5) • CNN Architecture • LLM (Large Language Model) • Striding and padding • Classification Tasks with Generative AI • Max pooling • Content Generation and Summarization with • Data Augmentation Generative AI • Introduction to OpenCV & YoloV3 Algorithm • Information Retrieval and Synthesis workflow with Gen AI Image Processing with OpenCV Time Series and Forecasting • Image basics with OpenCV • Opening Image Files with OpenCV • Time Series Forecasting using Deep Learning • Drawing on Images, Image files with OpenCV • Seasonal-Trend decomposition using LOESS • Face Detection with OpenCV (STL) models. • Bayesian time series analysis Video Processing with OpenCV MakerSuite Google • Introduction to Video Basics, Object Detection • Object Detection with OpenCV • PaLM API • MUM models Reinforcement Learning Azure ML • Introduction to Reinforcement Learning • Architecture of Reinforcement Learning • •Reinforcement Learning with Open AI • Policy Gradient Theory