This presentation provides an overview of data science and a data science internship focused on using Python. It introduces data science and why Python is popular for it. It discusses understanding statistics, predictive modeling, machine learning basics and applications. The final project problem statement involves predicting if bank clients will subscribe to term deposits based on client and call data using machine learning techniques. Upon completion, an internship certificate will be provided.
This presentation provides an overview of data science and a data science internship focused on using Python. It introduces data science and why Python is popular for it. It discusses understanding statistics, predictive modeling, machine learning basics and applications. The final project problem statement involves predicting if bank clients will subscribe to term deposits based on client and call data using machine learning techniques. Upon completion, an internship certificate will be provided.
This presentation provides an overview of data science and a data science internship focused on using Python. It introduces data science and why Python is popular for it. It discusses understanding statistics, predictive modeling, machine learning basics and applications. The final project problem statement involves predicting if bank clients will subscribe to term deposits based on client and call data using machine learning techniques. Upon completion, an internship certificate will be provided.
This presentation provides an overview of data science and a data science internship focused on using Python. It introduces data science and why Python is popular for it. It discusses understanding statistics, predictive modeling, machine learning basics and applications. The final project problem statement involves predicting if bank clients will subscribe to term deposits based on client and call data using machine learning techniques. Upon completion, an internship certificate will be provided.
• Python for Data Science. • Understanding the Statistics for Data Science. • Predictive Modeling and Basics of Machine Learning. • The Final Project. WHAT IS DATA SCIENCE ?
• Data science is the study of data to extract meaningful
insights for business. • It is a multidisciplinary approach that combines principles and practices from the fields of mathematics, statistics, artificial intelligence, and computer engineering to analyze large amounts of data. WHY PYTHON IS USED IN DATA SCIENCE ? • Python is a popular language for data science because it is easy to learn, has a large and active community, offers powerful libraries for data analysis and visualization, and has excellent machine-learning libraries. APPLICATIONS OF DATA SCIENCE UNDERSTANDING STATISTICS
• Statistics is the study and manipulation of data, including ways to
gather, review, analyze, and draw conclusions from data. • The two major areas of statistics are descriptive and inferential statistics. • Basically there are three types of statistics. They are: (i) Mean, (ii) Median, and (iii) Mode. PREDICTIVE MODELING • Predictive modeling is a mathematical process used to predict future events or outcomes by analyzing patterns in a given set of input data. • It is a crucial component of predictive analytics, a type of data analytics which uses current and historical data to forecast activity, behavior and trends. • Different Predictive modeling techniques are Artificial neural network, Mathematical model, Decision tree, Linear regression, Logistic regression, Multiple regression, Forecasting, Data mining etc. BASICS OF MACHINE LEARNING • Machine learning is a subset of artificial intelligence focused on building systems that can learn from historical data, identify patterns, and make logical decisions with little to no human intervention. • Types of Machine Learning: 1.Supervised Machine Learning 2.Unsupervised Machine Learning 3.Semi-Supervised Machine Learning 4.Reinforcement Learning APPLICATIONS OF MACHINE LEARNING. • Machine learning is used in • internet search engines, • email filters – used to sort out spam, • Websites - to make personalized recommendations, • banking software - to detect unusual transactions, • and lots of apps on our phones such as voice recognition. PROBLEM STATEMENT FOR FINAL PROJECT • Your client is a retail banking institution. Term deposits are a major source of income for a bank. • A term deposit is a cash investment held at a financial institution. Your money is invested for an agreed rate of interest over a fixed amount of time, or term. PROBLEM STATEMENT FOR FINAL PROJECT(CONT..) • The bank has various outreach plans to sell term deposits to their customers such as email marketing, advertisements, telephonic marketing and digital marketing. • Telephonic marketing campaigns still remain one of the most effective way to reach out to people. However, they require huge investment as large call centers are hired to actually execute these campaigns. Hence, it is crucial to identify the customers most likely to convert beforehand so that they can be specifically targeted via call. PROBLEM STATEMENT FOR FINAL PROJECT(CONT..) • You are provided with the client data such as : age of the client, their job type, their marital status, etc. Along with the client data, you are also provided with the information of the call such as the duration of the call, day and month of the call, etc. Given this information, your task is to predict if the client will subscribe to term deposit. RESULT OF THE GIVEN PROBLEM STATEMENT LOOKS LIKE THIS.. INTERNSHIP COMPLETION CERTIFICATE