Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 25

Programming for Data Analysis

Presented by: Presented To:


Sagar Paudel (NP000418) Mr. R.N Thakur
Module Leader
BSc.IT 4rd Semester
Introduction

• R programming language developed in 1993 by Ross Ihaka and Robert


Gentleman.

• R is used for statistical computation, analysis of data, and research projects.

• most widely used languages for retrieving, cleaning, analyzing, visualizing,


and presenting data by scientists, computer engineers, experts, and
marketers.
Assumption

For the sake of greater comprehension, many assumptions have been made across
the dataset. Assumptions like columns with a lot of Not Available (NA) numbers
that only provide you the most basic information about the weather. Appropriate
programming practices such as comments, variable name standards, and indentation
should all be used while creating a good data set. At least 14 analytical examples
spanning data visualization, exploration, and manipulation should be included, with
at least two extra features outside of the course that might improve the findings.
Objective and Aim

Objective
• The data will be analyzed and visualized.
• To get the most out of data.
• To foresee and arrive at the correct decision for reference purposes.
Aim
• By analyzing a significant quantity of data contained in a csv file,
you may help forecast the future.
Problem Statement

Despite the fact that many organizations utilize excel files for data analysis, it is
causing problems with appropriate data analysis. So, in this task, R programs are
used to do R-script activities for visualization, manipulation, and exploration of
data approaches, which aid in forecasting purposes.
Hardware and software specification
Hardware requirement
• Laptop with minimum 2 GB of RAM and 2 core CPU Processor.
Software requirement
• Latest version RStudio, R Tools, Windows OS.
Code and Documentation
Installed Packages and load library
Analysis
Analysis 1
Source code:

Output:
Conti….
Analysis 2
Source code:

Output:
Conti….
Analysis 3
Code:

Output:
Conti….
Analysis 4

Code:

Output:
Conti….
Analysis 5

Code:

Output:
Conti….
Analysis 6
Code:

Output:
Conti….
Analysis 7

Code:

Output:
Conti….
Analysis 8
Code:

Output:
Conti….
Analysis 9

Code:

Output:
Conti….
Analysis 10

Code:

Output:
Conti….
Analysis 11

Code:

Output:
Conti….
Analysis 12

Code:

Output:
Conti….
Analysis 13

Code:

Output:
Conti…..
Analysis 14

Code:

Output:
Extra Features
Extra Features Analysis 1

Code:

Output:
Conti….
Extra Feature Analysis 2

Code:

Output:
Extra Features for improvement of results

While analyzing a particular data set, additional features such as the polar bar of the
dataset and the violin plot are generated to aid in the exploration of data analysis
expertise. Violin plots have comparable features to box plots, but they additionally
display the kernel probability density of the data at various levels. Polar bars, on the
other hand, help explain data in the polar coordinate system by constructing a graph
in the shape of a circular circle out of the data columns.
Conclusion

The "Hourlyweatherdata.CSV" file contains weather data information. For the


forecast, all of the available data is analyzed. The RStudio and r-script languages are
used for all data visualization, data exploration, data import, and data manipulation
tasks. Many diagrams are made, including bar plots, bar graphs, scatter plots,
histograms, tiles, and line charts. As a result, I've learned the value of data
gathering, as well as why and how data is analyzed.

You might also like