Professional Documents
Culture Documents
Dafm Cia
Dafm Cia
DONE BY
PRANESH R B
SECTION – N
2228139
DATASET SOURCES:
This dataset consist of data From 1985 Ward's Automotive Yearbook. This data set consists
of three types of entities: (a) the specification of an auto in terms of various characteristics,
(b) its assigned insurance risk rating, (c) its normalized losses in use as compared to other
cars. The second rating corresponds to the degree to which the auto is more risky than its
price indicates. Cars are initially assigned a risk factor symbol associated with its price. Then,
if it is more risky (or less), this symbol is adjusted by moving it up (or down) the scale.
Actuarians call this process "symboling". A value of +3 indicates that the auto is risky, -3 that
it is probably pretty safe. The third factor is the relative average loss payment per insured
vehicle year. This value is normalized for all autos within a particular size classification (two-
door small, station wagons, sports/speciality, etc…), and represents the average loss per car
per year.
USING THESE DATA, MULTIPLE ANALYSIS HAVE BEEN MADE;
1]To find the number of cars based on the fuel type.
2] To identify the body types and its count.
3] To identify the moving average of sale price of the car
4]To find the horsepower relating it to engine size.
Count of fuel-
Fuel type type
Diesel 20
Gas 185
Grand Total 205
Total
20
diesel
gas
185
INTERPRETATION:
This pie chart represents the number of gas and diesel car from the available data From
the above pie chart we can understand significant number of people use gas cars compared to
diesel. So this tells people prefer gas cars to diesel cars.
2. ORDINAL DATA:
The order of the values is important and significant in ordinal scales, but the
differences between them are not well understood. Ordinal data do not specify the
accurate difference between the data given.
Number of body
Car body type type
convertible 6
hardtop 8
hatchback 70
sedan 96
wagon 25
Grand Total 205
120
100 96
count of body style
80
70
60
Total
40
25
20
6 8
0
convertible hardtop hatchback sedan wagon
Body styles
INTERPRETATION:
The above bar graph represents the number of cars in each body type. We can
conclude by saying, sedan is the most preferred body type.
3.INTERVAL DATA:
The frequency distribution of a collection of data reveals how frequently each
unique value appears. The most typical graph for displaying frequency distribution is a
histogram. Although it resembles a bar chart in appearance there are significant variations.
Frequenc
Bin y
5118 1
7995.286 60
10872.57 46
13749.86 28
16627.14 22
19504.43 21
22381.71 5
25259 4
28136.29 1
31013.57 3
33890.86 3
36768.14 6
39645.43 2
42522.71 2
More 1
70
Histogram
60
50
40
Frequency
Frequency
30
Moving average (Frequency)
20
10
0
18 14 43 14 86 57 29 59 71 43 14 86 57 29 ore
51 857 57 1 85 7 142 428 71 4 252 285 57 1 85 7 14 2 428 71 4 M
.2 2. 9. 7. 4. 1. 6. 3. 0. 8. 5. 2.
9 9 5 087 374 66 2 95 0 238 8 1 3 101 389 676 96 4 252
7 1 1 1 1 2 2 3 3 3 3 4
Bin
INTERPRETATION:
The histogram is compared with bin and frequency to know the moving average of price of
the car. Prices are in euros.
4. RATIO DATA:
The most informative measurement scale is the ratio scale. It is an interval scale with the additional
property of indicating the absence of the quantity being measured at its zero position.
350
300
250
HORSE POWER
200
150
100
50
0
0 50 100 150 200 250 300 350
ENGINE SIZE
INTERPRETAION:
The above scatterplot diagram depicts the relationship between the engine size and
horse power. Hence both are directly proportionate.