IBM Certified Data Science Course Brochure - Learnbay - 2020

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 26

www.learnbay.

co

DATA SCIENCE
AND AI
CERTIFICATION
Master's
PROGRAM program

CO-DEVELOPED WITH

250+ hrs of live 1 year of unlimited 12 + real-time Guaranteed job


classroom by flexible classroom projects & referrals in top
industry expert subscription Capstone companies

Classroom Training in Bangalore | Live Online Training | 6 Months Certification Program


@learnvista pvt. ltd.
Program Highlights
Learnbay offers Data Science and Artificial Intelligence Certification Program which is
co-developed and Certified with IBM. Course features 12+ real world industry projects
and 2 capstone projects under the mentor-ship and guidance of Data Science and AI
experts.
Course is especially designed for working professionals having 1+ years of experience in
any domain. Our course is best suited for professionals looking to change their current
domain and start a new career in Data science and Artificial Intelligence.

Live Sessions By Expert Project Based Learning One Year Flexible Subscription
Classroom training in Bangalore 12+ Real World Industry Projects Flexibility to attend multiple
Live Faculty led Online Training 2 Capstone Projects batches from different trainers.
250+ hrs of Interactive Classes Mentorship & Guidance By Expert Life time access to Recordings

Special Support to Non Certification from IBM in Job Assistance Program


Programmers Data Science And AI For Working Professionals
Learn Python from scratch IBM certified Data science and Resume support from expert
Special classes for Non AI program Interview prep session and
programming background Industry Accredited Global Mock interview
students Certification Course Guaranteed job referrals for
Real time Use Cases from multiple Co-developed With IBM working professionals
domain

Top Rated Training


Institute in India For Data
Science And AI Certification

4.8 Top Rated 4.9


300+ user Review
Become IBM Certified
Data Sience & AI Expert

Click to read reveiws

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Program Details
Program Eligibility :
Work Experience :
Working Professionals With 1+ Years of experience in any domain (tech or non technical)
Academics :
BE/B.Tech (from any branch) , BBA/MBA, MCA/M.Tech, B.Com, Graduation in Mathematics,
Statistics, IT

Who Should Apply :


Software developers/Programmers, Project Managers, Manual And Automation Test
Engineer, Java and .net Developer, Informatica, Business Analyst.
Database Admin, System Admin, Professionals from Sales, Marketing, Operations.
SAP domain expert, Python , Embedded developer , Android/ios developer.
Professionals from BFSI, Supply chain, Retail, healthcare, Pharma.
Manufacturing, Mechanical, Electrical, Automobiles, Telecom domain.We have domain
specific project from these sectors.
Professionals planning for Masters or higher education in data science and AI
To check your eligibility, Apply for Profile Review and Counselling with expert:

Click here to apply for profile review

About Instructors:
Our instructors are working professionals graduated from premier institutes like BITS
Pilani, IIT Roorkee and working in companies as Data Scientist/Machine Learning
Engineer and Artificial Intelligence expert.

Instructors Working in

Course Prerequisite:
There is no Prerequisite for this course as we cover programming and statistics from basics.
We provide special classes & support for professionals from non-programming/
non-technical background.

Fees and Duration:

Weekday Batches : 4 Months Program Fee:


(Mon to Fri - 2 hrs everyday, Classroom (Bangalore) :Rs. 59,000/- + tax
8:00 am to 10:00 am IST) Live Online :Rs. 49,000/- + tax
To know more about applicable discount, Live
Weekend Batches : 6 Months chat on Whatsapp :
(Sat & Sun - 4 hrs 
morning & afternoon slot available) Click to Whatsapp

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Modules And Tools

PYTHON STATISTICS MACHINE


LEARNING
DEEP LEARNING TIME SERIES NATURAL
USING ANALYSIS AND
FORECASTING
LANGUAGE
TENSORFLOW PROCESSING

Deployment of Machine
Learning Model on Google
Cloud

Real world
Industry Banking Insurance
Finance Retail Supply chain
Project
from
multiple
domains

Healthcare Telecom Manufacturing E-commerce Automotive

Interview
Prep
&
Job
Assistance
Resume Prep Interview Prep Mock Interviews Job Referrals in
Program
Session Session By Expert data science

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Global Certification in Data
Science And AI
Become an industry expert with Data Scientist & AI Master’s Program in
collaboration with IBM. Upon completion of this Program, you will receive the
certificate from IBM which will help you to become industry ready.

Get Industry-renowned global certification in Data Science and Artificial


Intelligence. Our certification is recognized globally and industry wide in
companies like JP Morgan, Morgan Stanley, Wells Fargo, Antuit , Genpact,
Cognizant, Delloite, E&Y, Tredence Analytics, Mu-sigma and other top MNCs and
Banking & Finance companies.

Sample Certificate

Download Certificate

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Demo & Sample Class Recordings

Watch more demo session

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Job Assistance

3
1 2 4 5
Resume
Certificate Project Preparation Job Referral
Update

After completion Attend project After Start preparing Once you get
of your program sessions from certification and youself with eligible, you will
you have to pass industry experts project session mock interviews start getting
final exam to get to get a hands on update your and guided guarenteed
IBM Certificate. experience of real resume. interview Interview Calls
time projects. sessions.

Eligibility Criteria

Should have completed Term 1,2 and Term 3 of our program (Refer
Course brochure for details)
Should have more than 1 Years of work experience (in any Domain)
Should have scored passing marks in IBM final Certification exam
Should have completed 70% of Assignments and case studies
At-least completed 2 Projects (Mentored and guided by our expert)

To know more about Guaranteed Interview call, Job Referral &


Industrial Projects

Download Project & Job Referral Brochure

Whatsapp Now

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Placement And Success Stories

Manu Agrawal Dhruv Satyam Bikash Bhuyan


Working at Microsoft Working at Infosys Data Scientist at Shell
My journey with Learnbay has All the faculties/trainers are
Everything about this program is
excelled me, bringing out my superb.They know the concepts
credible
talent to its utmost. I would of their respective areas.. They
If you miss any class you can
personally like to thank Krishna, are well versed that what a new
watch recorded sessions
Pankaj, Utkarsh and all the people comer wants to know &
All practice and real time codes
whose efforts are involved with understand..Really a superb
are available in repository and institute & awesome
this institute. The trainers here are
the best part is you can shift trainers.Outstanding institute for
very friendly and professional at
batches as per your convenience Data Science for professionals.
the same time.

View LinkedIn Page View LinkedIn Page View LinkedIn Page

Rahul Anand Keerti Bafna Suman Karmakar


Data Scientist at Bridgei2i Technical specialist at IBM
Data Scientist at Antuit
Learnbay is one of the best It was a good and effective
I joined the Data Science batch of
institutes in Bangalore. The faculty course with dedicated
September 2018. The trainer was
members are experienced working faculties for modules.You
Amritansh. And since then i have
professionals and they help you to get flexibility to attend
build the concepts in order to evolved in Machine Learning
classes from multiple
achieve your goals. The whole drastically . The trainer is very
instructors.Very Supportive
course and practical sessions are educated and teaches passionately
environment for learning.
very helpful specially in the field of The staff is supporting and you can
data science. re-attend and switch classes
anytime

View LinkedIn Page View LinkedIn Page View LinkedIn Page

Srikanth Saurav Aswini Dindukurthy Shakti Suwan


Senior Data Scientist at EY Working at Deloitte Lead Analyst at Amex
Machine Learning concepts & I have taken Data Science course I Joined Learnbay as Fresher
Statistics are very well explained from Learnbay 3 years back, it is And Attended training in data
by Utkarsh. Best thing was Excellent training center. After my science And Artificial
completing the syllabus on-time as training I was equal to 3+ exp. I Intelligence.Course is job
they have promised. Trainers are had a very good trainer , Real- oriented, Practical and in-depth
clearing the doubts . Got multiple Time Project Oriented Classes, .To the point, well versed
joining offers from different MNCs but one thing I have to say to all trainers, well engineered
1 YEAR
for Data Science and AI developer that daily practice is very much course. Superb!!
needed.
SUBSCRIPTION

View LinkedIn Page View LinkedIn Page View LinkedIn Page

Rajeev Kumar
Consultant at Tata Group
Good Trainer and nice supportive
environment.One of the best
classroom institute in Bangalore
for working professionals looking
Read More Reviews
to change their domain to data
science.

View LinkedIn Page

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


One Year Flexible Subscription

About One Year Classroom Subscription:

One year Flexible Subscription program is designed for working professional so that you can learn
at your pace without missing any classes. With this program, you get access to attend multiple
classroom/Faculty led online batches for a period of 1 year.

Learn at your own pace with unlimited


flexible access of multiple batches.
Option to attend multiple batches from
different instructors in classroom/live
online mode
Backup classes from other batches.
You can attend weekdays batch or
weekend or both based on your
availability
Repeat or revise modules multiple
times.

Program Fee

CLASSROOM MODE :
Rs. 59,000 +taxes CLICK HERE
TO GENERATE
DISCOUNT
ONLINE MODE :
Rs. 49,000 +taxes COUPON

PAY IN 6 INTEREST FREE EMI

INTEREST FREE INSTANT LOAN NO COST EMI ON MAJOR


WITHOUT CREDIT CARD CREDIT CARDS
Aadhar Card and Pan Card required ICICI, HDFC, RBL, Standard Chartered,
Axis bank,Kotak credit cards

Click here to apply for INTEREST FREE LOAN

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


How to Apply For this Program ?

Step 1 :

Talk to Our Admission Executive


Contact our Admission Team for more details on course eligibility ,Queries
on course curriculum, Certification etc.. If your profile is suitable for this
course, you will be further guided for detailed counselling and Profile Review
sessions.

Request A Callback Whatsapp Now

Step 2 :

Apply For Profile Review & Personalized Counselling


Attend Personalised Career Counselling and profile review session with
expert. This session will help you to understand whether your profile is
suitable for data science and AI certification course.
Note:You can attend this session online or visting our HSR center (Bangalore)

Apply For Profile Review


Step 3 :

Pay and Enrol For this Program:


Contact our Admission Officer for discount coupon. Apply the discount
coupon and enrol for IBM certified Program .

Pay and Enroll for the program 

Admission Officer :

Abhishek Gupta Chetna Ahuja


Mail id: abhishek.gupta@learnbay.co Mail id: chetna.ahuja@learnbay.co
Mob Num : +91 7760071231 Mob Num : +91 8296432774
Syllabus | 4 Terms | 6 Months

TERM 1 :

Modules/Tools : Core Python + Numpy + Pandas + Matplotlib + Seaborn


Term Duration : 5 Weeks (40 hours) : : 1.5 Months

TERM 2 :
Modules/Tools : Statistics (3 weeks - 24 hrs) + Machine Learning ( 6 Week - 48 hrs ) +
Capstone Project
Term Duration : 9 Weeks (72 hours) : : 2 Months

TERM 3 :
Modules/Tools : Deep Learning using Tensor-flow (2 Weeks - 16 hours) + Natural
Language Processing & Text Analytics (3 Weeks - 20 hours) +
Capstone Project
Term Duration : 5 Weeks ( 40 hours) : : 1 Month

Final Exam for IBM Certification after Term 3

Important Note :
After Successful completion of term 1, term 2 and term 3, Candidates become eligible for Job
Assistance Program (2- 3 weeks) which includes :
Resume Session and Assistance
Interview Prep Session & Mock Interview
Participating in Live Kaggle Competitions
List of Important Interview Questions from each modules
Guaranteed Job Referrals for Data Science/ML engineer roles
Certification From IBM ( Upon scoring passing marks in IBM final Exam)
You can start attending interviews after Term 3 and keep learning other modules from
Term 4 simultaneously.

TERM 4 :
Modules/Tools : (SQL  + MongoDB ) + (Tableau + PowerBI) + Cloud Deployment of
ML Model using GCP +( Hadoop basics & Apache Spark )  + R
Programming
Term Duration : 9 Weeks ( 72 hours) : : 2 Months

Attend guided session for real time projects from multiple domain and get
project Support/Mentorship from expert instructors.

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Term 1
MODULE 1 : PYTHON FOR DATA SCIENCE | 40 hours
Python

1. Programming Basics & 2. Python Programming Overview


Environment Setup Python Overview
Installing Anaconda, Anaconda Python 2.7 vs Python 3
Basics and Introduction Writing your First Python Program
Get familiar with version control, Git Lines and Indentation, Python
and GitHub. Identifiers
Basic Github Commands. Various Operators and Operators
Introduction to Jupyter Notebook Precedence
environment. Basics Jupyter Getting input from
notebook Commands. User,Comments,Multi line
Programming language basics. Comments.

3. Strings, Decisions And Loop 4. Python Data Types


Control List,Tuples,Dictionaries 
Working With Numbers, Booleans Python Lists,Tuples,Dictionaries
and Strings,String types and formatting, Accessing Values,Basic Operations
String operations Indexing, Slicing, and Matrixes
Simple if Statement, if-else Statement Built-in Functions & Methods
if-elif Statement. Exercises on List,Tuples And Dictionary
Introduction to while Loops. Class hands-on :
Introduction to for Loops,Using Program to convert tuple to dictionary
continue and break. Remove Duplicate from Lists
Class hands-on : Python program to reverse a tuple
Program to add all elements in list.
6 programs/coding exercise on string,
+ 3 more programs to be covered in class
loop and conditions in classroom

5. Functions And Modules 6. File I/O And Exceptional Handling


Introduction To Functions – Why and Regular Expression
Defining Functions Opening and Closing Files
Calling Functions open Function,file Object Attributes
Functions With Multiple Arguments. close() Method ,Read,write,seek.
Anonymous Functions - Lambda Exception Handling, try-finally Clause
Using Built-In Modules,User-Defined Raising an Exceptions,User-Defined
Modules,Module Namespaces, Exceptions
Iterators And Generators Regular Expression- Search and Replace
Class hands-on : Regular Expression Modifiers
8+ Programs to be covered in class from Regular Expression Patterns,re module
functions, Lambda, modules, Generators Class hands-on :
and Packages. 10+ Programs to be covered in class from File
IO,Reg-ex and exception handling.

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Term 1
MODULE 1 : PYTHON FOR DATA SCIENCE | 32 hours
Python

7. Data Analysis Using Numpy And 8. Data Visualisation using Python:


Pandas Matplotlib and Seaborn
Introduction to Numpy. Array Matplotlib:
Creation,Printing Arrays, Basic Operation - Introduction,plot(),Controlling Line
Indexing,Slicing and Iterating, Shape Properties,Subplot with Functional
Manipulation - Changing shape,stacking Method, MUltiple Plot, Working with
and spliting of array Multiple Figures,Histograms
Vector stacking, Broadcasting with Numpy, Seaborn :
Numpy for Statistical Operation. Intro to Seaborn And Visualizing
Pandas : Introduction to Pandas statistical relationships , Import and
Importing data into Python Prepare data .Plotting with categorical
Pandas Data Frames,Indexing Data Frames data and Visualizing linear
,Basic Operations With Data relationships
frame,Renaming Columns,Subletting and Seaborn Exercise
filtering a data frame.

Real time Use cases in Python to be Covered in Class

3 Case Study on Numpy, Pandas , Matplotlib


1 Case Study on Pandas And Seaborn
Assessment Test in Python : 2 hour of Assesment Test in Python ( Coding & Objective Questions )

Assignment 1 (Week 1):


10 Coding exercises on Python Basics - Variables, Operators, Strings, Loops
Assignment 2 (Week 2):
10 Python Programs and practice set on List,Tuples ,Dictionaries & matrices operations

Assignment 3 (Week 3):


10 Coding exercises on Functions, File And Regular Expression

Assignment 4 (Week 4):


15 Programs and Practice set Questions on Numpy and Pandas

Assignment 5 (Week 5):


2 Case Studies using Numpy Pandas and Matplotlib.

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Term 2
MODULE 2 : STATISTICS FOR DATA SCIENCE | 24 hours
Stats & ML

1.  Fundamentals of Math and 2. Descriptive Statistics


Probability  Describe or sumarise a set of data
Basic understanding of linear algebra, Measure of central tendency and
Matrics, vectors measure of dispersion.
Addition and Multimplication of matrics The mean,median,mode, curtosis and
Fundamentals of Probability skewness
Probability distributed function and Computing Standard deviation and
cumulative distributed function. Variance.
Class Hand-on Types of distribution.
Problem solving using R for vector Class Handson:
manupulation 5 Point summary BoxPlot
Problem solving for probability Histogram and Bar Chart
assignments Exploratory analytics R Methods

3. Inferential Statistics
conti..
What is inferential statistics
Type-l error and Type-ll errors
Different types of Sampling techniques
P-Value and Z-Score Method
Central Limit Theorem
T-Test, Analysis of variance(ANOVA)
Point estimate and Interval estimate
and Analysis of Co variance(ANCOVA)
Creating confidence interval for
Regression analysis in ANOVA
population parameter
Class Hands-on:
Characteristics of Z-distribution and T-
Problem solving for C.L.T
Distribution
Problem solving Hypothesis Testing
Basics of Hypothesis Testing
Problem solving for T-test, Z-score
Type of test and rejection region
test
Type of errors in Hypothesis resting,
Case study and model run for ANOVA,
conti..
ANCOVA

4. Hypothesis Testing 5. Data Processing & Exploratory


Hypothesis Testing Data Analysis
Basics of Hypothesis Testing Introduction to Data Cleaning
Type of test and Rejection Region Data Pre-processing
Type o errors-Type 1 Errors,Type 2 What is Data Wrangling?
Errors How to Restructure the data?
P value method,Z score Method. What is Data Integration?
The Chi-Square Test of Independence Data Transformation
Regression EDA : Finding and Dealing with Missing
Factorial Analysis of Variance Values.What are Outliers? Using Z-
Pearson Correlation Coefficients in Depth scores to Find Outliers. Introduction to
Statistical Significance, Effect Size, and Bivariate Analysis,Scatter Plots and
Confidence Intervals Heatmaps. Introduction to Multivariate
Analysis

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Term 2
MODULE 3 : MACHINE LEARNING ALGORITHMS | 48 hours  Stats & ML

Introduction To Machine Learning 1. Supervised Learning


What is Machine Learning? Support Vector Machines
Introduction to Supervised and Linear regression
Unsupervised Learning Logistic regression
Introduction to SKLEARN Naive Bayes
(Classification, Regression, Linear discriminant analysis
Clustering, Dimensionality Decision tree
reduction, Model selection, k-nearest neighbor algorithm
Preprocessing) Neural Networks (Multilayer
What is Reinforcement Learning? perceptron)
Machine Learning applications Similarity learning
Difference between Machine
Learning and Deep Learning

2. Linear Regression 3. Logistic Regression


Introduction to Linear Regression Introduction to Logistic Regression.– Why
Linear Regression with Multiple Logistic Regression .
Variables Introduce the notion of classification
Disadvantage of Linear Models Cost function for logistic regression
Interpretation of Model Outputs Application of logistic regression to
Understanding Covariance and multi-class classification.
Colinearity Confusion Matrix, Odd's Ratio And ROC
Understanding Heteroscedasticity Curve
Advantages And Disadvantages of
Case Study – Application of
Logistic Regression.
Linear Regression for Housing
Case Study:To classify an email as spam
Price Prediction
or not spam using logistic Regression.

4. Decision Trees Case Study:


Decision Tree – data set 1 Business Case Study for Kart
How to build decision tree? Model
Understanding Kart Model 2 Business Case Study for  Random
Classification Rules- Overfitting Forest
Problem 3 Business Case Study for  SVM
Stopping Criteria And Pruning
How to Find final size of Trees?
Model A decision Tree.
Naive Bayes
Random Forests and Support Vector
Machines
Interpretation of Model Outputs

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Term 2
MODULE 3 : MACHINE LEARNING ALGORITHMS | 48 hours
Stats & ML

5. Unsupervised Learning 6. Natural language Processing


Hierarchical Clustering Introduction to natural Language
k-Means algorithm for clustering – Processing(NLP).
groupings of unlabeled data points. Word Frequency Algorithms for NLP
Principal Component Analysis(PCA)- Sentiment Analysis
Data  Case Study :
Independent components analysis(ICA) Twitter data analysis using NLP
Anomaly Detection
Recommender System-collaborative
filtering algorithm
Case Study– Recommendation Engine
for e-commerce/retail chain

7. Introduction to Time Series 8. ARIMA and Multivariate Time


Forecasting Series Analysis
Basics of Time Series Analysis and Introduction to ARIMA Models,ARIMA
Forecasting ,Method Selection in Model Calculations,Manual ARIMA
Forecasting Parameter Selection,ARIMA with
Moving Average (MA) Forecast Explanatory Variables
Example,Different Components of Understanding Multivariate Time
Time Series Data ,Log Based Series and Their Structure,Checking
Differencing, Linear Regression For for Stationarity and Differencing the
Detrending MTS 
Case Study : Performing Time Series
Analysis on Stock Prices 

Important Note :
All  Machine Learning Algorithms are covered in depth with Real time case studies for each Algorithm 
Once 60% of ML is completed , Capstone Project will be released for the batch.

Assignments :
Statistics Assignments : Total 4 practice set and Assignments from Statistics
Machine Learning Assignments : Total 3 Practice Set And 2 Real time use case as Assignments

Assessment Test For Term2 :


Duration : 3 hours
Question Type : Objective & ML Case Studies

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Term 3
MODULE 4 : TENSORFLOW & DEEP LEARNING | 16 hours Deep Learning
& NLP

1. Introduction to Deep Learning 2. Introduction to Tensor Flow


And Tensor Flow Installing TensorFlow
Neural Network Simple Computation ,Contants And
Understaing Neural Network Model Variables
Installing TensorFlow Types of file formats in TensorFlow
Simple Computation ,Contants And Creatting A Graph - Graph
Variables Visualization
Types of file formats in TensorFlow Creating a Model - Logistic
Creatting A Graph – Graph Regression
Visualization Model Building
Creating a Model  – Logistic Regression TensorFlow Classification Examples
Model Building using tensor flow
TensorFlow Classification Examples

3.. Understanding Neural 4. Convolutional Neural


Networks With Tensor Flow Network(CNN)
Basic Neural Network Convolutional Layer Motivation
Single Hidden Layer Model Convolutional Layer Application
Multiple Hidden Layer Model Architecture of a CNN
Backpropagation – Learning Pooling Layer Application
Algorithm Deep CNN
and visual representation Understanding and Visualizing a
Understand Backpropagation – Using CNN
Neural
Network Example Project : Building a CNN for Image
TensorBoard Classification
Project on backpropagation

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Term 3
MODULE 5 : NATURAL LANGUAGE PROCESSING | 20 hours
Deep Learning
& NLP

Information Extraction
Machine Translation Information Retrieval

NLP
Sentiment Analysis Question Answering

1. Introduction to NLP & Text 2. Text Pre Processing Techniques


Analytics Need of Pre-Processing
Introduction to Text Analytics Various methods to Process the Text
Introduction to NLP data
What is Natural Language Processing? Tokenization ,Challenges in
What Can Developers Use NLP Tokenization
Algorithms For? Stopping ,Stop Word Removal
NLP Libraries Stemming - Errors in Stemming
Need of Textual Analytics Types of Stemming Algorithms -
Applications of Natural Language Table
Procession lookup Approach ,N-Gram Stemmers
Word Frequency Algorithms for NLP
Sentiment Analysis

3. Distance Algorithms used in Text 4. Information Retrieval Systems


Analytics Information Retrieval -
String Similarity Precision,Recall,F- score
Cosine Similarity Mechanishm - TF-IDF
Similarity KNN for document retrieval
between Two text documents K-Means for document retrieval
Levenshtein distance - measuring the Clustering for document retrieval
difference between two sequences
5. Projects And Case Studies
Applications of Levenshtein distance
a. Sentiment analysis for twitter, web
LCS(Longest Common Sequence )
articles
Problems
b. Movie Review Prediction
and solutions ,LCS Algorithms
c. Summarization of Restaurant
Reviews

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Term 4
MODULE 6 : SQL & MONGODB | 14 hours Value Added
Skillset

1. RDBMS And SQL Operations : 2. NoSQL Databases : 


Introduction To RDBMS  Topics - What is HBase?
Single Table Queries - HBase Architecture, HBase
SELECT,WHERE,ORDER Components,
BY,Distinct,And ,OR  Storage Model of HBase,
Multiple Table Queries:  INNER, SELF, HBase vs RDBMS
CROSS, and OUTER, Join, Left Join, Introduction to Mongo DB, CRUD
Right Join, Full Join, Union  Advantages of MongoDB over
Advance SQL Operations: RDBMS
Data Aggregations  and summarizing Use cases
the data
Ranking Functions: Top-N Analysis
Advanced SQL Queries for Analytics

3. Programming with SQL :  4. MongoDB Overview :


Mathematical Functions Where MongoDB is used?
Variables MongoDB Structures
Conditional Logic
MongoDB Shell vs MongoDB Server
Loops
Data Formats in MongoDB
Custom Functions
Grouping and Ordering MongoDB Aggregation Framework
Partitioning Aggregating Documents
Filtering Data What are MongoDB Drivers?
Subqueries

5. Basics and CRUD Operation : 6. Introduction to MongoDB :


Databases, Collection & Documents What is MongoDB?
Shell & MongoDB drivers Charateristics and Features
What is JSON Data MongoDB Ecosystem
Create, Read, Update, Delete Installation process
Finding, Deleting, Updating, Connecting to MongoDB database
Inserting Elements Introduction to NoSQL
Working with Arrays Introduction of MongoDB module
Understanding Schemas and What are ObjectIds in MongoDb
Relations

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Term 4
MODULE 7 : TABLEAU AND  POWER BI | 16 hours Value Added
Skillset

1. Introduction to Tableau : 2. Visual Analytics :


Connecting to data source Getting Started With Visual Analytics
Creating dashboard pages Sorting and grouping
How to create calculated columns Working with sets, set action
Different charts Filters: Ways to filter, Interactive Filters
Hands-on : Forecasting and Clustering
Hands on on connecting data source Hands-on :
and data cleansing Hands on deployment of Predictive
Hands on various charts model in visualization

3. Dashboard and Stories : 4. Mapping :


Working in Views with Dashboards Coordinate points
and Stories Plotting Latitude and Longitude
Working with Sheets Custom Geocoding
Fitting Sheets Polygon Maps
Legends and Quick Filters WMS and Background Image
Tiled and Floating Layout
Floating Objects

5. Getting Started With Power BI : 6. Programming with Power BI : 


Installing Power BI Desktop and Working with Timeseries
Connecting to Data Understanding aggregation and
Overview of the Workflow in Power BI granularity
Desktop Filters and Slicers in Power BI
Introducing the Different Views of the Maps, Scatterplots and BI Reports
Data Mode Connecting Dataset with Power BI
Query Editor Interface Creating a Customer Segmentation
Working on Data Model Dashboard
Analyzing the Customer Segmentation
Dashboard

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Term 4
MODULE 8 : BIG DATA AND SPARK ANALYTICS | 12 hours Value Added
Skillset

1. Introduction To Hadoop :  2. Apache  Spark Analytics : 


Distributed Architecture - A Brief What is Spark
Overview Introduction to Spark RDD
Understanding Big Data Introduction to Spark SQL and
Introduction To Hadoop ,Hadoop Dataframes
Architecture Using R-Spark for machine learning
HDFS ,Overview of MapReduce Hands-on:
Framework installation and configuration of Spark
 Hadoop Master – Slave Architecture
MapReduce Architecture Using R-Spark for machine learning
Use cases of MapReduce programming

3. Apache  Spark Analytics :  Hands-on:


Getting to know PySpark Map reduce Use Case 1 : Youtube data
Pyspark Introduction analysis
Pyspark Environment Setup Map reduce Use Case 2:   Uber Data
pySpark - Spark context Analytics
RDD , Broadcast and
Accumulator Hands-on:
Sparkconf and Sparkfiles Spark RDD programming
Spark MLlib Overview Hands-on:
,Algorithms and utilities in Spark Spark SQL and Dataframe
Mlib programming

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Term 4
MODULE 9 : R PROGRAMMING | 12 hours Value Added
Skillset

1. Introduction To R :  2. Programming with R :


Installation Setup Creating an object
Quick guide to RStudio User Interface Data types in R
RStudio's GUI3 Coercion rules in R
Changing the appearance in RStudio Functions and arguments
Installing packages in R and using the Matrices
library Data Frame
Development Environment Overview Data Inputs and Outputs with R
Introduction to R basics Vectors and Vector operation
Building blocks of R Advanced Visualization
Core programming principles Using the script vs. using the
Fundamentals of R console

3. Manipulating Data : 4. Visualizing Data :


Data transformation with R - the Intro to data visualization
Dplyr package - Part Introduction to ggplot2
Data transformation with R - the Building a histogram with ggplot2
Dplyr package - Part Building a bar chart with ggplot2
Sampling data with the Dplyr Building a box and whiskers plot
package with ggplot2
Using the pipe operator in R Building a scatterplot with
Tidying data in R - gather() and ggplot2
separate()
Tidying data in R - unite() and
spread()

MODULE 10 : TRAINING AND DEPLOYING MACHINE LEARNING


MODEL USING GCP | 8 hours

1. Introduction To GCP Cloud ML 2. Training Machine Learning


Engine : Model :
Introduction to Google CloudML Developing a training application
Engine Packaging a training application
CloudML Engine in Machine Learning Running and monitoring a training
WorkFlow job
Components of Cloud ML Engine - Using hyperparameter tuning
Google Cloud Platform Console. Using GPUs for training models in
gcloud command-line tool and Rest the cloud
API

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Real Time Industry Projects

Domain - Banking & Finance Domain - Retail industry


DataSet : Banking Data DataSet : BigBazar/Future
Project : Loan Default Group
Prediction Project : Clustering Customers
The bank wants to improve their Big Bazaar has retail outlets across
services by finding interesting major metropolitan cities in India. With
groups of clients. Fortunately, the the help of machine learning algorithms
bank stores data about their we can better understand customer
clients, the accounts (transactions behaviour and understand their buying
within several months), the loans needs better.
already granted, the credit cards BigBazaar runs various loyalty
issued. This process of loan default programs, festive offers which provide
prediction can be done with their customer more opportunities to
machine learning algorithms. avail discounts.

Domain - Demand/Supply Domain - Demand/Supply


DataSet : IBM DataSet :  Uber & Rapido
Project - IBM HR Analytics  Project- Forecasting Uber
Demand
Applying analytic processes to the
human resource department of an The goal is to create an interactive
organization in the hope of dashboard using Tableau
improving employee performance This Tableau Dashboard can be used
and therefore getting a better to get historical insights into a
return on investment. neighborhood,
This is especially concerning if your For example,
business is customer facing, as see its upcoming forecasted demand,
customers often prefer to interact increase the accuracy,
with familiar people. decrease surge pricing events.

Domain -  Healthcare Domain - Banking & Finance


DataSet : Samsung DataSet : Banking Dataset
Project - Analyzing Health Data Project - Identify fraudulent
and tracking human activity credit card transactions.
The goal is to breakdown all the To recognize fraudulent credit
data that the Samsung Health card transactions so that
app has collected and see what customers are not charged for
useful insights we can gain by items that they did not purchase.
analyzing it. It involves various processes like
Data Cleaning, Data Visualization,
Insights generation, Model
generation, Feature Engineering
and so on.

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Domain - E-Commerce Domain - Travel & Hospitality
DataSet : Amazon Data DataSet : Airbnb
Project - Consumer Reviews of Project - Airbnb New User
Amazon Products Bookings
The goal is to analyze Amazon’s most The goal is to predict which country a
successful consumer electronics new user's first booking destination
product launches; discover insights will be.
into consumer reviews and assist with By accurately predicting where a new
machine learning models. user will book their first travel
What are the most reviewed Amazon experience, Airbnb can share more
products? personalized content with their
How do the reviews in the first 90 community, decrease the average
days after a product launch? time to first booking, and better
forecast demand.

Domain - Media and


Entertainment Domain - Retail
DataSet : Netflix DataSet :  Walmart
Project - Netflix Movies and TV Project - Walmart Sales
Shows Forecasting
Explore what all other insights can be This dataset contains the sales for
obtained from the list of tv shows and each department from the Walmart
movies available on Netflix as of dataset containing data of 45
2019. Understanding what content is Walmart stores, selected holiday
available in different countries markdown events are also included
Identifying similar content by . These markdowns are known to
matching text-based features affect sales, but it is challenging to
Network analysis of Actors / Directors predict which departments are
and find interesting insights. affected and the extent of the
impact.

Domain - Automation Domain - Manufacturing


DataSet : BMW dataset DataSet :  Bosch
Project -BMW Pricing Challenge Project - Bosch Production Line
To find a good statistical model to Performance
describe the value of a used car To predict internal failures using
depending on the basic thousands of measurements and
description  tests made for each component
How does the estimated value of along the assembly line. This would
a car change over time? Can you enable Bosch to bring quality
detect any patterns? products at lower costs to the end
How big is the influence of the user.
factors not represented in the The goal is to predict which parts
data on the price? will fail quality control

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Domain - Social Media Domain - Telecom
DataSet : youtube DataSet : Telecom
Project - Trending YouTube Video Project  - Identify And Predict
Statistics Customer churn in telecom
The dataset of this project are daily industry
record of the top trending YouTube The goal is to develop a churn
videos, to generate insights like : prediction model which assists
Sentiment analysis in a variety of telecom operators to predict
forms customers who are most likely
Categorising YouTube videos based on subject to churn.Also to
their comments and statistics understand the customer
Training ML algorithms like RNNs to behavior and reasons for
generate their own YouTube churn.Apply multiple
comments. classification models to predict
the customer churn in telecom
industry.

Domain - Supply Chain


DataSet : Dataco
Project - Smart Supply Chain for
Big Data Analysis
A DataSet of Supply Chains used by
the company DataCo Global is used
for the analysis. Dataset of Supply
Chain , which allows the use of
Machine Learning Algorithms and R
Software.
It also allows the correlation of
Structured Data with Unstructured
Data for knowledge generation.

Watch the videos to


know more about RAPIDO PROJECT FRAUD DETECTION
Projects :

CUSTOMER SEGMENTATION RETAIL PROJECT

click to Whatsapp @learnvista pvt. ltd. www.learnbay.co


Contact Us

Call Us Mail Us
7349-2222-63 contacts@learnbay.co

Visit Us Whatsapp Us
www.learnbay.co 7349-2222-63

click on icon to
Follow Us On

Marathahalli Office :
Learnbay,19/1,2nd Floor, Classic
Aura(Beside Aricent),Marathahalli -
Outer Ring Road,Kadubeesanahalli,
Bengaluru, Karnataka

HSR Office :
Learnbay ,147, 5th Main Rd, Rajiv
Gandhi Nagar, HSR Sector 7,Near
Salarpuria Serenity, Bengaluru,
Karnataka 560102
INDIA
+917349222263

You might also like