Professional Documents
Culture Documents
Machine Learning Syllabus
Machine Learning Syllabus
Numpy, Pandas, Matplotlib, Plotly, Seaborn, Scipy, Scikit learn, Regex, File Operation in
python
One hot encoding
Normalization
Regularization, Generalization
Computer Vision Data Pre Processing
● Introduction to Image Pre-Processing
● Data augmentation
● Transformation operation
● Pixel brightness transformations(PBT)
● Gamma Correction
● Histogram equalization
● Sigmoid stretching
● Geometric Transformations
● Image Filtering and Segmentation
● Image Segmentation
● Fourier transform
NLP-Data PreProcessing
● Tokenization
● Lemmatization and stemming.
● Bag of words
● TF-IDF
● Stop Words Removal
● NGrams
● Regex Matching
● Text Matching
● Chunking
● Date Matcher
● Part-of-speech tagging
● Sentence Detector (DL models)
● Dependency parsing
● Sentiment Detection (ML models)
● Spell Checker (ML & DL models)
● Doc2Vec Embeddings (Word2Vec)
● Word2Vec Embeddings (Word2Vec)
● Word Embeddings (GloVe & Word2Vec)
● Sentiment Analysis.
● Named Entity Recognition.(NER)
● Summarization.
● Topic Modeling.
● Text Classification.
● Keyword Extraction.
● LDA
NLP:
Words Cloud
Knowledge graphs
BERT Embeddings
DistilBERT Embeddings
RoBERTa Embeddings
DeBERTa Embeddings
XLM-RoBERTa Embeddings
Longformer Embeddings
ALBERT Embeddings
XLNet Embeddings
RNN
LSTM
GRU
Transformer
ELMO Embeddings
Universal Sentence Encoder
Sentence Embeddings
Chunk Embeddings
Neural Machine Translation (MarianMT)
Text-To-Text Transfer Transformer (Google T5)
Generative Pre-trained Transformer 2 (OpenAI GPT-2)
Unsupervised keywords extraction
Language Detection & Identification (up to 375 languages)
Multi-class Text Classification (DL model)
Multi-label Text Classification (DL model)
Multi-class Sentiment Analysis (DL model)
BERT for Token & Sequence Classification
DistilBERT for Token & Sequence Classification
ALBERT for Token & Sequence Classification
RoBERTa for Token & Sequence Classification
XLM-RoBERTa for Token & Sequence Classification
XLNet for Token & Sequence Classification
Longformer for Token & Sequence Classification
Named entity recognition (DL model)
Easy TensorFlow integration
GPU Support
Full integration with Spark ML functions
Additional Learning:
1. MLOPs
2. Git, Github
3. Pyspark, Hadoop, etc.
4. Docker, Fast API, Rest API
5. NoSQL,MongoDB,MySQL
6. Data Structure and Algorithms****
7. AWS, Microsoft Azure, Google Cloud Platform
8. Stackhome (e.g from where can we get the dataset?)