Professional Documents
Culture Documents
Ml-Data Wrangling-Assignment 01
Ml-Data Wrangling-Assignment 01
Machine Learning
Assignment 01 (CLO-02)
Course Instructor: Aqsa Afzal
Date: Dated:
INSTRUCTIONS
Instructor Signature
Gathering of Data
Gathered three different datasets with different formats in three different ways
1. Download the :
[twitter_archive_enhanced.csv](https://d17h27t6h515a5.cloudfront.net/topher/2017/
August/59a4e958_twitter-archive-enhanced/twitter-archive-enhanced.csv) file and read it into
a pandas dataframe.
2. Programmatically downloaded the second file, 'image-prediction.tsv' from the provided [url
here](https://d17h27t6h515a5.cloudfront.net/topher/2017/August/599fd2ad_image-
predictions/image-predictions.tsv) using the Requests library
3. Sourced data from Twitter using the Tweepy library to query additional data via the Twitter
API, saving it into a txt file 'tweet_json.txt' and read it line by line into a pandas dataframe.
Major Work has already been done, you just have to analyze the code and follow the text and
comments provided
Import python notebook as pdf and submit it. For VS code, do install Markdown(for
comment and text and pdf).
Don’t include the long output in hard form
Outputs for Issues(mentioned in notebook) should be visible