Professional Documents
Culture Documents
ML UNIT 1
ML UNIT 1
Kaggle Datasets
UCI Machine Learning Repository
Indian Government dataset
◦ https://data.gov.in/
A Dataset is a table with the data from which the machine learns. The dataset contains the features and the target to predict.
When used to induce a model, the dataset is called training data.
An Instance is a row in the dataset. Other names for ‘instance’ are: (data) point, example, observation.
The Features are the inputs used for prediction or classification. A feature is a column in the dataset.
In the above example, we can see that the agent is given 2 options i.e. a path with water or a path with fire.
A reinforcement algorithm works on reward a system i.e. if the agent uses the fire path then the rewards are subtracted and
agent tries to learn that it should avoid the fire path.
If it had chosen the water path or the safe path then some points would have been added to the reward points, the agent then
would try to learn what path is safe and what path isn’t.
It is basically leveraging the rewards obtained, the agent improves its environment knowledge to select the next action.
To get a Successful learning system it should be designed properly. The design has to be
proper and Effective. The following steps are involved in designing a learning system
1. Choosing the Training Experience
2. Choosing the Target Function
3. Choosing a Representation for the Target Function
4. Choosing a Function Approximation Algorithm
5. The Final Design
Step 1. Choosing the Training Experience
(i) Type of training experience i.e. direct or indirect
(ii) Degree to which the learner can control the sequence of
training example
(iii) how well the training experience represents the distribution
of examples over the final system performance P must be measured
13-03-2024 43
Eg. Learning Driving
Trainers help
Completely own
ChooseMove : B -> M to indicate that this function
accepts as input any board from the set of legal board
states B and produces as output some move from the
set of legal moves M
LIST-THEN-ELIMINATE General-to-Specific
Bottom-up Negative
Hypothesis literally means an idea or theory that the researcher sets as the goal of the study and examines it
and is replaced as a theory when the hypothesis is true in the study's conclusion.
A hypothesis (plural: hypotheses), in a scientific context, is a testable statement about the relationship
between two or more variables or a proposed explanation for some observed phenomenon.
Hypothesis testing is the core of the scientific method.
A statistical hypothesis is an explanation about the relationship between data populations that is interpreted
probabilistically.
A machine learning hypothesis is a candidate model that approximates a target function for mapping
inputs to outputs.
Example:
If you increase the duration of light, (then) corn plants will grow more each day.
The hypothesis establishes two variables, length of light exposure, and the rate of plant
growth. An experiment could be designed to test whether the rate of growth depends on
the duration of light. 13-03-2024 78
c(x) = Target Function
h(x) = hypothesis
Internal Node
Terminal/Leaf Node
Splitting of Branches:
2 – two Splitting
3- three Splitting
This tree classifies Saturday mornings according to whether or not they are 3 or more – Multi way Splitting
suitable for playing tennis.
For example, the above instance would be sorted down the leftmost branch of this decision
tree and would therefore be classified as a negative instance (i.e., the tree predicts that
PlayTennis = no)
Decision trees represent a disjunction of conjunctions of constraints on the attribute values of
instances.
Each path from the tree root to a leaf corresponds to a conjunction of attribute tests, and the
tree itself to a disjunction of these conjunctions.
Decision Tree is best suited to problems for the following Characteristics: