Internet ....

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 15

STUDENT MARKS EVALUATION

PROJECT INTRODUCTION

 The advent of information technology in various fields has lead the


large volumes of data storage in various formats like records, files,
documents, images, sound, videos, scientific data and many new
data formats. The data collected from different applications require
proper method of extracting knowledge from large repositories for
better decision making. Knowledge discovery in databases (KDD),
often called data mining, aims at the discovery of useful information
from large collections of data [1]. The main functions of data mining
are applying various methods and algorithms in order to discover
and extract patterns of stored data [2]. Data mining and knowledge
discovery applications have got a rich focus due to its significance in
decision making and it has become an essential component in
various organizations. Data mining techniques have been
introduced into new fields of Statistics, Databases, Machine
Learning, Pattern Reorganization, Artificial Intelligence and
Computation capabilities etc.
PROJECT INFORMATION

 Introduction to Machine Learning:Machine learning


is a field of computer science that gives computers
the ability to learn without being explicitly
programmed.
 PYTHON:Python is a high-level, interpreted,
interactive and object-oriented scripting language.
Python is designed to be highly readable. It uses
English keywords frequently where as other
languages use punctuation, and has fewer syntactical
const
MODEL INTRODUCTION

 DATA MINING DEFINITION AND TECHNIQUES Data mining, also popularly known as
Knowledge Discovery in Database, refers to extracting or “mining" knowledge from large
amounts of data. Data mining techniques are used to operate on large volumes of data to
discover hidden patterns and relationships helpful in decision making. While data
mining and knowledge discovery in database are frequently treated as synonyms, data
mining is actually part of the knowledge discovery process. The sequences of steps
identified in extracting knowledge from data are shown in

 Various algorithms and techniques like Classification, Clustering, Regression,


Artificial Intelligence, Neural Networks, Association Rules, Decision Trees,
Genetic Algorithm, Nearest Neighbor method etc., are used for knowledge
discovery from databases. These techniques and methods in data mining need
brief mention to have better understanding
1. Classification

2. Clustering

3. Predication

4. Association rule

5. Neural networks

6. Decision Trees

7. Nearest Neighbor Method


DATA SET COLLECTION
MODEL IMPLEMENTATION

 Linear Regression
 The term "linearity" in algebra refers to a linear
relationship between two or more variables. If we draw
this relationship in a two dimensional space (between
two variables, in this case), we get a straight line.
 to determine the linear relationship between the
numbers of hours a student studies and the percentage of
marks that student scores in an exam. We want to find
out that given the number of hours a student prepares for
a test, about how high of a score can the student achieve?
If we plot the independent variable (hours) on the x-axis
and dependent variable (percentage) on the y-axis, linear
regression gives us a straight line that best fits the data
points,
LINEAR
REGRESSION
1. y = mx + b

2. Where b is the intercept


and m is the slope of the
line.

3. The y and xvariables


remain the same, since
they are the data features
and cannot be changed.
The values that we can
control are the intercept
and slope. There can be
multiple straight lines
depending upon the values
of intercept and slope.

4. A regression model involving


multiple variables can be
represented as:

y = b0 + m1b1 + m2b2 + m3b3 +


... ... mnbn
Linear Regression with Python
Scikit Learn

1. Simple Linear Regression: In this


regression task we will predict the
percentage of marks that a student is
expected to score based upon the number
of hours they studied. This is a simple
linear regression task as it involves just
two variables.

2. Importing Libraries

To import necessary libraries for this task,


execute the following import statements:

import pandas as pd

import numpy as np

import matplotlib.pyplot as
plt%matplotlib inline

3. Dataset

dataset=pd.read_csv('student_scores.csv'
)

dataset.head()
DATASET
To see statistical details
of the dataset, we can
use describe():

dataset.describe()
DATASET
1.

dataset.plot(x='Hours',
y='Scores', style='o')

plt.title('Hours vs
Percentage')
plt.xlabel('Hours
Studied')
plt.ylabel('Percentage
Score') plt.show()
Evaluating the Algorithm
MAKING PREDICTIONS
MATPLOTLIB
THANK YOU

You might also like