Welcome to Scribd!

Data Science Projects

Uploaded by

0% found this document useful (0 votes)

24 views3 pages

An advanced practitioner's project walkthrough with ChatGPT would involve more hands-on exploration of the data through hypothesis testing and descriptive statistics. It would also have the practitioner generate some initial code and then use ChatGPT to refine it through iteration, while reviewing the changes to better understand the process. Focusing on areas like feature engineering that AI has not fully mastered yet helps practitioners continue building skills through hands-on projects.

Original Description:

Original Title

Data Science projects

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

24 views3 pages

Data Science Projects

Uploaded by

Hanane Gríssette

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 3

Search inside document

An example of a beginner’s project walkthrough could look like this:

1. You feed ChatGPT the information about the rows and

columns of the data
2. You ask it to create boilerplate code to explore this data for
null values, outliers, and normality
3. You ask it what questions you should ask of this data
4. You ask it to clean the data and build the model for you to
make a prediction on the dependent variable

While it may seem like it is doing all the work for you, you still have to
get this project to run in your environment. You are also prompting and
problem solving as you go along.

There is no guarantee that it will work like there is when you’re copying
someone else’s project, so I feel like this is a nice learning middle
ground for involvement.

An Advanced Practitioner’s Project Walkthrough

Now, let’s think about how a more advanced practitioner would use
this:

1. You could follow the same steps of generating boilerplate code, but
this should be expanded upon. So, you might want to experiment with
more hands-on exploration of the data and hypothesis testing. Maybe,
choose one or two questions you want to answer with data and
descriptive statistics and start analyzing it.

2. For someone who has done a few projects, I recommend generating

some of the code yourself. Let’s say you made a simple bar chart in
plotly. You could feed that in and ask ChatGPT to reformat it, to change
the color or the scale, etc.

By doing this, you can rapidly iterate on visualizations, and you can see
in real time how different tweaks to the code change the graph. This
immediate feedback is great for learning.

3. I also think it is important that you review these changes and see
how they were made. Also if you don’t understand something, just ask
ChatGPT right there to expand on what it did.

4. More advanced practitioners should also focus more heavily on the

data engineering and the pipelines for productionizing code. These are
things that you still need to be fairly hands-on with. I found that
ChatGPT was able to get me part of the way there, but I needed to do a
lot of debugging myself.

5. From there, you may want to go through and have the AI run some
algorithms and do parameter tuning. To be honest, I think this will be
the part of data science that will be automated the fastest. I think
parameter tuning will see diminishing returns for normal practitioners,
but maybe not for the highest level Kagglers.
6. You should focus your time on feature engineering and feature
creation. This is also something that the AI models can help with, but
not completely master. After you’ve got some decent models, see what
data you can add, what features you can create, or what transforms you
can do to increase your results.

In a world with these advanced AI tools, I think it is even more

important to do projects than ever. You have to build things, and share
your work. Fortunately, with these AI tools, it is also easier than ever to
do that. It’s easier produce a web app. It’s easier to work with new
packages that you’ve never worked with before.

References

https://towardsdatascience.com/best-use-chatgpt-learn-data-science-easy-beginner-
b10299c49c4c

Ebook Prompt Engineering 101
Document26 pages
Ebook Prompt Engineering 101
FELIPE
100% (1)
The Crowdsourced Guide To The KPMG Virtual Internship PDF
Document15 pages
The Crowdsourced Guide To The KPMG Virtual Internship PDF
Shashank Singh
No ratings yet
Systems Thinking Book
Document42 pages
Systems Thinking Book
Arturo Andres
100% (4)
Create Your Custom ChatGPT With Transfer Learning
Document10 pages
Create Your Custom ChatGPT With Transfer Learning
Serena Pan
No ratings yet
5 Principles For Writing Effective Prompts
Document6 pages
5 Principles For Writing Effective Prompts
blkjack8
No ratings yet
CGPT For DS
Document24 pages
CGPT For DS
Sailendra Behera
100% (1)
ML Interview Questions
Document146 pages
ML Interview Questions
IndraneelGhosh
No ratings yet
Data Science Skills
Document4 pages
Data Science Skills
Jon
No ratings yet
Why Do AI Initiatives Fail
Document5 pages
Why Do AI Initiatives Fail
Md Ahsan Ali
No ratings yet
Machine Learning Project Example - Building A Model Step-By-Step PDF
Document9 pages
Machine Learning Project Example - Building A Model Step-By-Step PDF
SrinivasKannan
No ratings yet
How To Learn Data Science
Document8 pages
How To Learn Data Science
Atsal
100% (1)
Andrew Treadway - Software Engineering For Data Scientists (MEAP V03) - Manning Publications (2023)
Document319 pages
Andrew Treadway - Software Engineering For Data Scientists (MEAP V03) - Manning Publications (2023)
Erick Costa De Farias
No ratings yet
10 Things Know Before First Data Science Project
Document8 pages
10 Things Know Before First Data Science Project
shaikkulsum10
No ratings yet
Thesis 2 Logo Box
Document4 pages
Thesis 2 Logo Box
bk184deh
100% (2)
Non-Technical Database Design Errors: #1 Poor Planning
Document27 pages
Non-Technical Database Design Errors: #1 Poor Planning
prakba323109
No ratings yet
Manage Your Data Science Project Structure in Early Stage
Document7 pages
Manage Your Data Science Project Structure in Early Stage
Wellington Oliveira
No ratings yet
Week 2 - Select and Train A Model
Document29 pages
Week 2 - Select and Train A Model
Bhuwan Bhatt
No ratings yet
Ece3501 - Iot Fundamentals
Document41 pages
Ece3501 - Iot Fundamentals
Sudhir
No ratings yet
Preguntas Tests Udemy
Document338 pages
Preguntas Tests Udemy
David Clavijo
No ratings yet
Term Paper On Symbolic Debugging of Optimized Code
Document7 pages
Term Paper On Symbolic Debugging of Optimized Code
doawpfcnd
No ratings yet
CSC408Lab1.2 2
Document13 pages
CSC408Lab1.2 2
Sun Shine
No ratings yet
ML Projects For Final Year
Document7 pages
ML Projects For Final Year
Alia Khan
No ratings yet
ChatGPT For Nonfiction Authors - Leveraging AI for Impactful Writing: Series 1
From Everand
ChatGPT For Nonfiction Authors - Leveraging AI for Impactful Writing: Series 1
Anthony Joseph
No ratings yet
Life Cycle of DS Project
Document9 pages
Life Cycle of DS Project
Shivansh Ghelani
No ratings yet
6 Open Source Data Science Projects Interviewer
Document7 pages
6 Open Source Data Science Projects Interviewer
2016 Mat CORREA ALFARO SERGIO
No ratings yet
Builder Chat Log
Document124 pages
Builder Chat Log
hussainsyedmohammed7
No ratings yet
Building Arduino Projects For The Internet of Things
Document5 pages
Building Arduino Projects For The Internet of Things
santhosh n prabhu
No ratings yet
Can Ai Help Manage The Data Needed For Soc Verification?
Document3 pages
Can Ai Help Manage The Data Needed For Soc Verification?
Anoop Kumar
No ratings yet
Final_Report_Instructions
Document2 pages
Final_Report_Instructions
Eduardo Pereira Bártolo
No ratings yet
(Share - Extarnal) The Science Algorithm - Applicant Question
Document7 pages
(Share - Extarnal) The Science Algorithm - Applicant Question
adxn13
No ratings yet
AICE Milestone03 Safaa EL Haimoudi 31.03.2024
Document6 pages
AICE Milestone03 Safaa EL Haimoudi 31.03.2024
safaa el haimoudi
No ratings yet
Monetizing Machine Learning: Quickly Turn Python ML Ideas into Web Applications on the Serverless Cloud
From Everand
Monetizing Machine Learning: Quickly Turn Python ML Ideas into Web Applications on the Serverless Cloud
Manuel Amunategui
No ratings yet
RFC Research Paper
Document8 pages
RFC Research Paper
afednabte
100% (1)
Robotics Design Process:: Unit-Iii Robot Design
Document4 pages
Robotics Design Process:: Unit-Iii Robot Design
Nithya Paranthaman
No ratings yet
Executive Overview: What Exists Already
Document5 pages
Executive Overview: What Exists Already
rizzapps
No ratings yet
Introduction To Data Science: Dataset
Document13 pages
Introduction To Data Science: Dataset
yogesh
No ratings yet
Importance of MATLAB in Software Engineering
Document3 pages
Importance of MATLAB in Software Engineering
Ahsan Nawaz
No ratings yet
Hands On Machine Learning With Scikit Learn and Tensorflow
Document31 pages
Hands On Machine Learning With Scikit Learn and Tensorflow
sumasuthan
0% (1)
Sharepoint: 10 Things You Wish They Had Told You, Part 1: Found The Following
Document6 pages
Sharepoint: 10 Things You Wish They Had Told You, Part 1: Found The Following
semalaiappan
No ratings yet
A Learning Path To Becoming A Data Scientist - by Sara A. Metwalli - Oct, 2020 - Towards Data Science
Document9 pages
A Learning Path To Becoming A Data Scientist - by Sara A. Metwalli - Oct, 2020 - Towards Data Science
ritu
No ratings yet
(BONUS) Supercharge Your Programming With ChatGPT - QuickStart Guides
Document12 pages
(BONUS) Supercharge Your Programming With ChatGPT - QuickStart Guides
JabarH
100% (1)
Framework To Approach A Kaggle Problem: 1. Importing The Training / Test Population
Document2 pages
Framework To Approach A Kaggle Problem: 1. Importing The Training / Test Population
Govind Naik
No ratings yet
How To Prepare For An Interview at Google.
Document4 pages
How To Prepare For An Interview at Google.
Shashi Kolar
No ratings yet
2.1.1, 2.1.2, 2.1.3, 2.1.4 Hodder
Document20 pages
2.1.1, 2.1.2, 2.1.3, 2.1.4 Hodder
portable
No ratings yet
AICE Milestone03 Fenet Desta 31.03.24
Document9 pages
AICE Milestone03 Fenet Desta 31.03.24
Wordo bucha
No ratings yet
Calc MGR
Document38 pages
Calc MGR
nayakchandru2000
No ratings yet
Chatbot MSC
Document30 pages
Chatbot MSC
Wahid Khan
No ratings yet
7641 Assignment 1
Document4 pages
7641 Assignment 1
Muhammad Aleem
No ratings yet
How To Succeed in A System Design Interview
Document7 pages
How To Succeed in A System Design Interview
Bryan Lee
No ratings yet
Mastering Tableau 2023 (2023 - 4th)
Document60 pages
Mastering Tableau 2023 (2023 - 4th)
maan142219
No ratings yet
Refactoring A
Document2 pages
Refactoring A
api-3773276
100% (1)
10-Step Guide For Literally ANYONE To Land A 6-Figure FAANG Job
Document37 pages
10-Step Guide For Literally ANYONE To Land A 6-Figure FAANG Job
George
No ratings yet
CapStone Project
Document4 pages
CapStone Project
Manojay's Directionone
No ratings yet
ML Step by Step
Document10 pages
ML Step by Step
OUAFI Kheireddine
No ratings yet
Advice To Aspiring Data Engineers
Document2 pages
Advice To Aspiring Data Engineers
Henrique Santos
No ratings yet
2d66fc4-21a3-53b-4fd1-E4c1abe6e56 The Starter Guide For Modern Data
Document8 pages
2d66fc4-21a3-53b-4fd1-E4c1abe6e56 The Starter Guide For Modern Data
aiParacha
No ratings yet
Grumpy Testing Sample
Document14 pages
Grumpy Testing Sample
nermin_226
No ratings yet
Top Tableau Questionsand Answersin 2019
Document20 pages
Top Tableau Questionsand Answersin 2019
Vaishnavi Appaya
No ratings yet
Data Science Interview Resources
Document12 pages
Data Science Interview Resources
Krutika Sapkal
No ratings yet
PYTHON DATA SCIENCE: A Practical Guide to Mastering Python for Data Science and Artificial Intelligence (2023 Beginner Crash Course)
From Everand
PYTHON DATA SCIENCE: A Practical Guide to Mastering Python for Data Science and Artificial Intelligence (2023 Beginner Crash Course)
Calvert Long
No ratings yet
Mongodb DBA Homework 5.2
Document6 pages
Mongodb DBA Homework 5.2
g3r3cnd1
100% (1)
Guide 7 A
Document21 pages
Guide 7 A
engsamerhozin
No ratings yet
Functions of Several Variables (Part 3) : (S) Z 3x 3y - (S) F (X, Y)
Document8 pages
Functions of Several Variables (Part 3) : (S) Z 3x 3y - (S) F (X, Y)
Quang Dũng
No ratings yet
The Babylonian Zodiac - Robert Powell
Document6 pages
The Babylonian Zodiac - Robert Powell
be5992
No ratings yet
Geometry Viewed As A Difficult Mathematics
Document5 pages
Geometry Viewed As A Difficult Mathematics
International Journal of Innovative Science and Research Technology
No ratings yet
Aspen DMC3 Builder Jump Start Guide JSG
Document33 pages
Aspen DMC3 Builder Jump Start Guide JSG
nazmul hasan
No ratings yet
5th Sem
Document3 pages
5th Sem
Vinita Dahiya
No ratings yet
Squazzoni, Flaminio - Epistemological Aspects of Computer Simulation in The Social Sciences
Document190 pages
Squazzoni, Flaminio - Epistemological Aspects of Computer Simulation in The Social Sciences
Claudio Condori
100% (1)
Boolean Operations
Document5 pages
Boolean Operations
Kowsi Mathi
No ratings yet
Mil HDBK 727 PDF
Document569 pages
Mil HDBK 727 PDF
retrospect1000
No ratings yet
Beam Deflection Check
Document7 pages
Beam Deflection Check
ajith chandran
No ratings yet
TASK 6.5: Trigonometry 6
Document1 page
TASK 6.5: Trigonometry 6
dreaming cloudy
No ratings yet
The Atiyah-Hirzebruch Spectral Sequence: Caleb Ji
Document13 pages
The Atiyah-Hirzebruch Spectral Sequence: Caleb Ji
Caleb Ji
No ratings yet
Recent Advances in Antilock Braking Systems and Traction Control Systems
Document15 pages
Recent Advances in Antilock Braking Systems and Traction Control Systems
david_luz
No ratings yet
Senior High School: Rizal ST., Guimbal, Iloilo
Document4 pages
Senior High School: Rizal ST., Guimbal, Iloilo
Leah Labelle
No ratings yet
Partial Fraction
Document7 pages
Partial Fraction
Anonymous VASS3z0wTH
No ratings yet
04 Chep 11 Chemical Kinetics SET Final E
Document2 pages
04 Chep 11 Chemical Kinetics SET Final E
mridul
No ratings yet
Partial Fractions: Like Our Page For More Entry Test Materials & Admission Updates
Document4 pages
Partial Fractions: Like Our Page For More Entry Test Materials & Admission Updates
SAK
No ratings yet
Test Loop ADT Machine Communication-02.08.2017
Document3 pages
Test Loop ADT Machine Communication-02.08.2017
Sagar Pawar
No ratings yet
CBSE Class 10 Mathematics Sample Paper 1 Solutions PDF
Document7 pages
CBSE Class 10 Mathematics Sample Paper 1 Solutions PDF
Shona Khattar
No ratings yet
Sample-Tables Hands-On Activity
Document14 pages
Sample-Tables Hands-On Activity
concept tual
No ratings yet
Probabilistic Earthquake Hazard Assessment For Peninunsular India
Document27 pages
Probabilistic Earthquake Hazard Assessment For Peninunsular India
inigo38
No ratings yet
How To Influence Consumer Mindset: A Perspective From Service Recovery
Document13 pages
How To Influence Consumer Mindset: A Perspective From Service Recovery
zoya
No ratings yet
Crash Box Crash-Wothiness
Document5 pages
Crash Box Crash-Wothiness
abhishekverma5101459
No ratings yet
Technical Lettering
Document12 pages
Technical Lettering
thrift flip
No ratings yet
Unit Digits, Exponents, - Remainder Problems
Document5 pages
Unit Digits, Exponents, - Remainder Problems
Duc Anh
No ratings yet
Gurney Equations: Underlying Physics
Document7 pages
Gurney Equations: Underlying Physics
Dizzixx
No ratings yet
Senturk D. - Covariate-Adjusted Varying Coefficient Models (2006)
Document17 pages
Senturk D. - Covariate-Adjusted Varying Coefficient Models (2006)
Anonymous idBsC1
No ratings yet
Important Topics For GATE 2024 by S K Mondal
Document19 pages
Important Topics For GATE 2024 by S K Mondal
roseri
100% (1)