Professional Documents
Culture Documents
Data Science Projects
Data Science Projects
While it may seem like it is doing all the work for you, you still have to
get this project to run in your environment. You are also prompting and
problem solving as you go along.
There is no guarantee that it will work like there is when you’re copying
someone else’s project, so I feel like this is a nice learning middle
ground for involvement.
Now, let’s think about how a more advanced practitioner would use
this:
1. You could follow the same steps of generating boilerplate code, but
this should be expanded upon. So, you might want to experiment with
more hands-on exploration of the data and hypothesis testing. Maybe,
choose one or two questions you want to answer with data and
descriptive statistics and start analyzing it.
By doing this, you can rapidly iterate on visualizations, and you can see
in real time how different tweaks to the code change the graph. This
immediate feedback is great for learning.
3. I also think it is important that you review these changes and see
how they were made. Also if you don’t understand something, just ask
ChatGPT right there to expand on what it did.
5. From there, you may want to go through and have the AI run some
algorithms and do parameter tuning. To be honest, I think this will be
the part of data science that will be automated the fastest. I think
parameter tuning will see diminishing returns for normal practitioners,
but maybe not for the highest level Kagglers.
6. You should focus your time on feature engineering and feature
creation. This is also something that the AI models can help with, but
not completely master. After you’ve got some decent models, see what
data you can add, what features you can create, or what transforms you
can do to increase your results.
References
https://towardsdatascience.com/best-use-chatgpt-learn-data-science-easy-beginner-
b10299c49c4c