Professional Documents
Culture Documents
Steps Assignment
Steps Assignment
What is a Feature?
A feature refers to one unique attribute or variable in our data set. Since
data is often stored in rows and columns, a feature can often be defined
as a single column.
For example, if we would like to predict the price of a car, the target
variable would be the Market Value. The predictor variables start as a long
list of attributes that, through feature engineering, is slimmed down and
manipulated to produce a set of effective predictor variables.
1. Data Cleansing
Data cleansing prepares the data to be readable by the model; this means
that all missing values are appropriately handled and that all features are
in the correct data type. A typical data cleansing decision can be regarding
outliers. In some cases, removing outliers in the data will result in the best
model, while, in other cases, the outliers should be kept as the outliers
provide the model with valuable information about edge cases.
2. Data Transformation
3. Feature Extraction