Professional Documents
Culture Documents
Product Proposal 1
Product Proposal 1
Product Proposal 1
PRODUCT
PROPOSAL
MARCH • 15 • 2021
VANISHA SWABHANAM
ASTROPHYSICS
INTRODUCTION AND STATEMENT OF PURPOSE
The product I wish to create is a model that basically takes in a certain point in
space that is bright and detects whether that point is a star or a galaxy. The output
will be the chance that that point could be one or the other (percentage). Since I
will use many different Python libraries and databases for this project that real
world astrophysicists use, I will gain astronomical data analysis experience. It will
create a foundation for when I get a job in the future since many astrophysicists
spend most of their time on the computer programming data functions. In doing
so, I hope to obtain a wider understanding of the data science and analysis part of
astrophysics. This project will be an application of physical data skills I learn.
This could be used as the foundation to a useful tool in the astrophysics world.
An astronomical machine learning model that distinguishes and classifies two
similar groups between datasets could be applied to many other research areas in
astronomy for astrophysicists to use.
METHODOLOGY
I will execute this project by inputting stellar and galactic surveys from public
databases such as NASA.gov and CERN laboratories into a Pandas dataframe, which
is used to make the dataset easier to read. Then, I will clean the dataset, the
process of fixing or removing incorrect, corrupted, incorrectly formatted, duplicate,
or incomplete data within a dataset. I will build this program using a machine
learning model called Random Forest Algorithm using astroanalysis libraries such as
SciKit Learn and NumPy in the Python programming package. 80% of the data from
the set will be used to train the AI model and the rest will be used to test it.
MATERIALS
MS. PRIYA
LINGUTLA
NIELSEN CO
DATA SCIENTIST