Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 26

PROJECT First Review REPORT ON

Performances Analysis on Student Feedback Using Machine Learning Algorithms

MASTER OF COMPUTER APPLICATIONS

Presented by:

M. ANAND RAO (22091F0004)

Under the Guidance of

Mr.M.VINAY KUMAR, M.Tech, (Ph.D)


Assistant Professor, Dept. of MCA

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

RAJEEV GANDHI MEMORIAL COLLEGE OF ENGINEERING & TECHNOLOGY


(AUTONOMOUS)
AFFILIATED TO J.N.T UNIVERSITY ANANTAPUR. ACCREDITED BY NBA (TIER-1)
& NAAC OF UGC. NEW DELHI, WITH A+ GRADE RECOGNIZED UGC-DDU
KAUSHAL KENDRA NANDYAL-518501, (Estd-1995)
YEAR: 2024
RAJEEV GANDHI MEMORIAL COLLEGE OF ENGINEERING &
TECHNOLOGY
(Autonomous)
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

Project Title : Performances Analysis on Student


Feedback Using Machine Learning
Algorithms

Name of the Student : M. ANAND RAO

Reg. No : 22091F0004

Name of the Guide : Mr.M.VINAY KUMAR, M.Tech,(Ph.D)

Review : First

Signature of the Student Signature of the Guide

PRC Members Signature’s:

1.

2.

3.
Performance Analysis On Student Feedback Using Machine Learning Algorithms

ABSTRACT

It investigates opinion mining by means of supervised learning techniques to search out the
emotion of the student input bolstered characterized choices of instructing and learning. The
examination led includes the apparatus of a blend of AI and common language preparing
systems on understudy input data accumulated from module investigation overview
consequences of VR Siddhartha Engineering College, Vijayawada. Additionally, to offer a
grade by grade clarification of the technique of accomplishment of opinion mining on or after
scholar remarks using the open source tool Python, the work additionally offers a comparative
overall performance take a look remarks supported, extracted alternatives like examination,
teaching and so on. The consequences are as compared to be trying to find out higher overall
performance with relevance several evaluation standards designed for the various techniques.

1. INTRODUCTION

DEPT OF MCA RGMCET NANDYAL 1


Performance Analysis On Student Feedback Using Machine Learning Algorithms

The total of information that whichever organization within the humankind managing
is huge. This “Big Data” is often characterizing by three V’s i.e. volume, velocity, and
selection. It is usually a lot of concerning of these three dimensions coming back
along than considering any of them on an individual basis. The existence of enormous
amount of ordered records much like grades, conscription information, and
development quotes likewise as unstructured in order like scholar opinions articulated
via surveys, it will become time and useful source overwhelming to précis the facts by
hand to gain information lead conclusions and alternatives.
Studying analytics can be discipline of information analytics to procedures,
analyzes, reviews and predicts facts regarding learners for the aim of optimizing coaching
and studying. On the opposite side, it is essential to recognize the outline generated by
means of the information like scholar feedback to correctly develop the overall routine of
the established order and to form plans to reinforce institutions’ coaching and mastering
understanding. From among the numerous systems for aggregation pupil comments,
surveys contain a vital position and most of the establishments commence surveys in
several paperwork.
The focal point is on utilizing Opinion Mining approach for categorizing the
student’s feedback received all through component estimate survey that is accomplished
each semester to understand comments of scholars in regards to several options of
coaching and knowledge like module, teaching, assessments, and so forth. The mined and
preprocessed datasets be subjected to several supervised opinion mining method like aid
Vector gadget (SVM), Naive Bayes (NB), Artificial Neural Networks (NN) applied the
usage of Python, the open supply device accessible for opinion mining. The proportional
effectiveness of the algorithms in the selected utility circumstance is evaluated
exploitation accuracy; take into account and accuracy measures.

Accurateness is outlined because of the quantitative relation of entire


classifications that place unit specifically to the entire style of knowledge set. Accuracy is
that the quantitative relation of true positives toward the whole type of positives that
vicinity unit foretold while keeping in mind is that the quantitative relation of actual
positives by means of the complete positives inside the dataset.

2. LITERATURE REVIEW

DEPT OF MCA RGMCET NANDYAL 2


Performance Analysis On Student Feedback Using Machine Learning Algorithms

The paper spotlight on victimization Opinion Mining method for classifying the
student's comments acquired during module evaluation survey. The mined and
pre-processed datasets have been based to numerous supervised evaluations
taking out rule like aid Vector Naïve Bayes (NB), Nearest Neighbour (KNN) and
Neural Networks (NN) enforced [1].

It is visible currently that there is a rise of expertise availableness, the alleged


statistics deluge, controlled by accomplice inflated amount of electronic motion
accomplished, and additionally the revolutionary pervasive attain of IT
altogether gadgets. the number one of those developments is that the supposed
open statistics association, distinguished by the manner that the complete method
throughout European and also the united states, governments area unit more and
more e-book their information repositories for humans to admittance and use it
any other pattern issues the inconceivable amount of knowledge is formed handy
by electorate through participatory sensing": commonplace play a practical
position in booklet commentary and grumbling online, and increasingly more
make use of novelty to report further in sequence[2].

The paper projected a web feedback system that is concerning automating the
method of recording student’s feedback. The projected system collects the
feedback submitted by students and so classifies them as positive feedbacks or
negative feedbacks victimization SVM classifier. Then, it generates a
performance outline of a coach for the themes he or she schooled there in
academic term employing a delimited account rule. In my view, the projected
system may be helpful for the educational establishments and may additionally
facilitate academics to know their teaching performance in an exceedingly
summarized manner. This projected a web feedback system which might record
the student’s feedback and analyse the teacher's performance supported opinion
mining victimization SVM classifier and so summarizes the teacher performance
supported delimited outline rule i.e. count frequency of positive and negative
reviews[3].
The work aims to dig deeper into the feedback information of an establishment. Presently the
feedback information is employed to report solely the performance of the teacher. The paper
proposes ways to research the feedback information victimization data processing techniques
for a higher understanding of the college, course, and student. The format of feedback varies

DEPT OF MCA RGMCET NANDYAL 3


Performance Analysis On Student Feedback Using Machine Learning Algorithms

from establishment to establishment, thus there cannot be a general technique which will
appropriate all. The feedback information from the scholars is analysed by victimization
completely different data processing techniques. The feedback information is used for
analysing all the parameters thought of for feedback which might facilitate management in
creating policy choices in teaching learning method. This Paper surveys all data processing
technique that is applied for analysing feedback information [4].

A collection of associated information including disconnect data fundamentals stored,


retrieved, or else organized and indulgence as a unit, i.e., folder. For our project, we are using
a comma-separated value data structure. Datasets are of two types’ linear dataset and non-
linear dataset. The linear dataset is the one which is having equal properties whereas the
nonlinear dataset is the one which is having non-equal properties. Machine learning works
well for linear datasets. The datasets contain reviews made by students. This dataset contains
30,000 reviews with their student _id, student_ name, review, and their emotion. The review
contains textual input about the college.

3. ANALYSIS
3.1 EXISTINGSYSTEM
• The paper spotlight on victimization Opinion Mining method for classifying the
student's comments acquired during module evaluation survey. The mined and

DEPT OF MCA RGMCET NANDYAL 4


Performance Analysis On Student Feedback Using Machine Learning Algorithms

preprocessed datasets have been based to numerous supervised evaluations taking out
rule like aid Vector Naïve Bayes (NB), Nearest Neighbour (KNN) and Neural
Networks (NN) enforced [1].
• It is visible currently that there is a rise of expertise availableness, the alleged
statistics deluge, controlled by accomplice inflated amount of electronic motion
accomplished, and additionally the revolutionary pervasive attain of IT altogether
gadgets. the number one of those developments is that the supposed open statistics
association, distinguished by the manner that the complete method throughout
European and also the united states, governments area unit more and more e-book
their information repositories for humans to admittance and use it any other pattern
issues the inconceivable amount of knowledge is formed handy by electorate through
participatory sensing": commonplace play a practical position in booklet commentary
and grumbling online, and increasingly more make use of novelty to report further in
sequence.[2]
• The paper projected a web feedback system that is concerning automating the method
of recording student’s feedback. The projected system collects the feedback submitted
by students and so classifies them as positive feedbacks or negative feedbacks
victimization SVM classifier. Then, it generates a performance outline of a coach for
the themes he or she schooled there in academic term employing a delimited account
rule. In my view, the projected system may be helpful for the educational
establishments and may additionally facilitate academics to know their teaching
performance in an exceedingly summarized manner. This projected a web feedback
system which might record the student’s feedback and analyse the teacher's
performance supported opinion mining victimization SVM classifier and so
summarizes the teacher performance supported delimited outline rule i.e. count
frequency of positive and negative reviews.[3]
• The work aims to dig deeper into the feedback information of an establishment.
Presently the feedback information is employed to report solely the performance of
the teacher. The paper proposes ways to research the feedback information
victimization data processing techniques for a higher understanding of the college,
course, and student. The format of feedback varies from establishment to
establishment, thus there cannot be a general technique which will appropriate all. The
feedback information from the scholars is analysed by victimization completely
different data processing techniques. The feedback information is used for analysing
DEPT OF MCA RGMCET NANDYAL 5
Performance Analysis On Student Feedback Using Machine Learning Algorithms

all the parameters thought of for feedback which might facilitate management in
creating policy choices in teaching-learning method. This Paper surveys all data
processing technique that is applied for analysing feedback information.[4]

• 3.1.1 Disadvantages
• In the existing work, the system does not calculate large amount of data sets.
• This system is less performance due to lack of learning, Natural Language Processing.

3.2 Proposed System


• The data is collected from the students in the form of the feedback of the college. The
feedback consists of the textual reviews. Then the data pre-processing is done on the
data that is collected. Later the Opinion classification is performed on the dataset and
the results are obtained.
• The review contains textual input about the college. Pre-processing is the progression
of concentrated effort the data from redundant elements. It enlarges the accurateness
of the results by dropping errors in the data. Not by means of pre-processing, such as
enchantment corrections, may lead the system to disregard important words.
Preprocessing and concentrated effort of data are one of the most important tasks that
must be one before dataset be able to be used for machine learning. The real-world
statistics is strident, incomplete and incompatible. So, it is necessary to be cleaned.
There are many general pre-processing techniques, of which the majority common is:
tokenization, convert text to lower or upper case, eliminate punctuation, take away
numbers, take out repeated letters, and get rid of stop words, stemming and negation.
The data that is obtained after the pre-processing is done is given to the Machine
learning algorithm.

• Later the data is skilled through the machine learning algorithms. The built Machine
learning models present the confusion matrix and the accurateness of the models. The
dealing out time for running this module is also demonstrated. The output of the
algorithm is that it results in a new column which shows the predicted emotion of the
review.
• A collection of associated information including disconnect data fundamentals stored,
retrieved, or else organized and indulgence as a unit, i.e., folder. For our project, we
DEPT OF MCA RGMCET NANDYAL 6
Performance Analysis On Student Feedback Using Machine Learning Algorithms

are using a comma-separated value data structure. Datasets are of two types’ linear
dataset of two types’ linear dataset and non-linear dataset. The linear dataset is the one
which is having equal properties whereas the non-linear dataset is the one which is
having non-equal properties. Machine learning works well for linear datasets. The
datasets contain reviews made by students. This dataset contains 30,000 reviews with
their studentfiid, student_ name, review, and their emotion.

• 3.1.2. Advantages
• Accurateness is outlined because of the quantitative relation of entire classifications
that place unit specifically to the entire style of knowledge set.
• The proposed system projected a web feedback system that is concerning automating
the method of recording student’s feedback.

4. SYSTEM MODELS

DEPT OF MCA RGMCET NANDYAL 7


Performance Analysis On Student Feedback Using Machine Learning Algorithms

4.1 SDLC (Umbrella Model):

SDLC is nothing but Software Development Life Cycle. It is a standard which is used
by software industry to develop good software.

Stages in SDLC:

Requirement Gathering
Analysis
Designing
Coding
Testing
Maintenance

4.1.1 Requirements Gathering stage:

The requirements gathering process takes as its input the goals identified in the high
level requirements section of the project plan. Each goal will be refined into a set of one or
more requirements. These requirements define the major functions of the intended
application, define operational data areas and reference data areas, and define the initial data
DEPT OF MCA RGMCET NANDYAL 8
Performance Analysis On Student Feedback Using Machine Learning Algorithms

entities. Major functions include critical processes to be managed, as well as mission critical
inputs, outputs and reports. A user class hierarchy is developed and associated with these
major functions, data areas, and data entities. Each of these definitions is termed a
Requirement. Requirements are identified by unique requirement identifiers and, at minimum,
contain a requirement title and textual description.

These requirements are fully described in the primary deliverables for this stage: the
Requirements Document and the Requirements Traceability Matrix (RTM). The requirements
document contains complete descriptions of each requirement, including diagrams and
references to external documents as necessary. Note that detailed listings of database tables
and fields are not included in the requirements document.

The title of each requirement is also placed into the first version of the RTM, along
with the title of each goal from the project plan. The purpose of the RTM is to show that the
product components developed during each stage of the software development lifecycle are
formally connected to the components developed in prior stages.
In the requirements stage, the RTM consists of a list of high-level requirements, or
goals, by title, with a listing of associated requirements for each goal, listed by requirement
title. In this hierarchical listing, the RTM shows that each requirement developed during this
stage is formally linked to a specific product goal. In this format, each requirement can be
traced to a specific product goal, hence the term requirements traceability.

The outputs of the requirements definition stage include the requirements document, the
RTM, and an updated project plan.

DEPT OF MCA RGMCET NANDYAL 9


Performance Analysis On Student Feedback Using Machine Learning Algorithms

• Feasibility study is all about identification of problems in a project.

• No. of staff required to handle a project is represented as Team Formation, in this case
only modules are individual tasks will be assigned to employees who are working for
that project.

• Project Specifications are all about representing of various possible inputs submitting
to the server and corresponding outputs along with reports maintained by
administrator.

4.1.2 Analysis Stage:

The planning stage establishes a bird's eye view of the intended software product, and
uses this to establish the basic project structure, evaluate feasibility and risks associated with
the project, and describe appropriate management and technical approaches.

DEPT OF MCA RGMCET NANDYAL 10


Performance Analysis On Student Feedback Using Machine Learning Algorithms

The most critical section of the project plan is a listing of high-level product
requirements, also referred to as goals. All of the software product requirements to be
developed during the

Requirements definition stage flow from one or more of these goals. The minimum
information for each goal consists of a title and textual description, although additional
information and references to external documents may be included. The outputs of the project
planning stage are the configuration management plan, the quality assurance plan, and the

DEPT OF MCA RGMCET NANDYAL 11


Performance Analysis On Student Feedback Using Machine Learning Algorithms

project plan and schedule, with a detailed listing of scheduled activities for the upcoming
Requirements stage, and high level estimates of effort for the out stages.

4.1.3 Designing Stage:

The design stage takes as its initial input the requirements identified in the approved
requirements document. For each requirement, a set of one or more design elements will be
produced as a result of interviews, workshops, and/or prototype efforts. Design elements
describe the desired software features in detail, and generally include functional hierarchy
diagrams, screen layout diagrams, tables of business rules, business process diagrams, pseudo
code, and a complete entity-relationship diagram with a full data dictionary. These design
elements are intended to describe the software in sufficient detail that skilled programmers
may develop the software with minimal additional input.

4.1.4 Development (Coding) Stage:

The development stage takes as its primary input the design elements described in the
approved design document. For each design element, a set of one or more software artefacts
will be produced. Software artefacts include but are not limited to menus, dialogs, and data
management forms, data reporting formats, and specialized procedures and functions.
Appropriate test cases will be developed for each set of functionally related software artefacts,
and an online help system will be developed to guide users in their interactions with the
software.

DEPT OF MCA RGMCET NANDYAL 12


Performance Analysis On Student Feedback Using Machine Learning Algorithms

The RTM will be updated to show that each developed artefact is linked to a specific
design element, and that each developed artefact has one or more corresponding test case
items. At this point, the RTM is in its final configuration. The outputs of the development
stage include a fully functional set of software that satisfies the requirements and design
elements previously documented, an online help system that describes the operation of the
software, an implementation map that identifies the primary code entry points for all major
system functions, a test plan that describes the test cases to be used to validate the correctness
and completeness of the software, an updated RTM, and an updated project plan.

4.1.5 Integration & Test Stage:

During the integration and test stage, the software artefacts, online help, and test data
are migrated from the development environment to a separate test environment. At this point,
all test cases are run to verify the correctness and completeness of the software. Successful
execution of the test suite confirms a robust and complete migration capability. During this
stage, reference data is finalized for production use and production users are identified and
linked to their appropriate roles. The final reference data (or links to reference data source
files) and production user list are compiled into the Production Initiation Plan.

DEPT OF MCA RGMCET NANDYAL 13


Performance Analysis On Student Feedback Using Machine Learning Algorithms

The outputs of the integration and test stage include an integrated set of software, an
online help system, an implementation map, a production initiation plan that describes
reference data and production users, an acceptance plan which contains the final suite of test
cases, and an updated project plan.

4.1.6 Installation & Acceptance Test:

During the installation and acceptance stage, the software artefacts, online help, and
initial production data are loaded onto the production server. At this point, all test cases are
run to verify the correctness and completeness of the software. Successful execution of the
test suite is a prerequisite to acceptance of the software by the customer.

After customer personnel have verified that the initial production data load is correct
and the test suite has been executed with satisfactory results, the customer formally accepts
the delivery of the software.

DEPT OF MCA RGMCET NANDYAL 14


Performance Analysis On Student Feedback Using Machine Learning Algorithms

The primary outputs of the installation and acceptance stage include a production
application, a completed acceptance test suite, and a memorandum of customer acceptance of
the software. Finally, the PDR enters the last of the actual labour data into the project
schedule and locks the project as a permanent project record. At this point the PDR "locks"
the project by archiving all software items, the implementation map, the source code, and the
documentation for future reference.

4.2 UML DIAGRAMS:

UML stands for Unified Modelling Language. UML is a standardized general-purpose


modelling language in the field of object-oriented software engineering. The standard is
managed, and was created by, the Object Management Group.
The goal is for UML to become a common language for creating models of
objectoriented computer software. In its current form UML is comprised of two major
DEPT OF MCA RGMCET NANDYAL 15
Performance Analysis On Student Feedback Using Machine Learning Algorithms

components: a Meta-model and a notation. In the future, some form of method or process may
also be added to; or associated with, UML.
The Unified Modelling Language is a standard language for specifying, Visualization,
Constructing and documenting the artefacts of software system, as well as for business
modelling and other non-software systems.
The UML represents a collection of best engineering practices that have proven
successful in the modelling of large and complex systems.
The UML is a very important part of developing objects-oriented software and the
software development process. The UML uses mostly graphical notations to express the
design of software projects.
GOALS:

The Primary goals in the design of the UML are as follows:


1. Provide users a ready-to-use, expressive visual modelling Language so that they can develop
and exchange meaningful models.
2. Provide extensibility and specialization mechanisms to extend the core concepts.
3. Be independent of particular programming languages and development process.
4. Provide a formal basis for understanding the modelling language.
5. Encourage the growth of object oriented tools market.
6. Support higher level development concepts such as collaborations, frameworks, patterns and
components.

4.3 USE CASE DIAGRAM:


A use case diagram in the Unified Modelling Language (UML) is a type of
behavioural diagram defined by and created from a Use-case analysis. Its purpose is to present
a graphical overview of the functionality provided by a system in terms of actors, their goals
(represented as use cases), and any dependencies between those use cases. The main purpose
of a use case diagram is to show what system functions are performed for which actor. Roles
of the actors in the system can be depicted.

DEPT OF MCA RGMCET NANDYAL 16


Performance Analysis On Student Feedback Using Machine Learning Algorithms

Fig1: Use case Diagram

DEPT OF MCA RGMCET NANDYAL 17


Performance Analysis On Student Feedback Using Machine Learning Algorithms

4.4 CLASS DIAGRAM:


In software engineering, a class diagram in the Unified Modeling Language (UML) is
a type of static structure diagram that describes the structure of a system by showing the
system's classes, their attributes, operations (or methods), and the relationships among the
classes.

Fig2: Class Diagram

DEPT OF MCA RGMCET NANDYAL 18


Performance Analysis On Student Feedback Using Machine Learning Algorithms

4.5 SEQUENCE DIAGRAM:

A sequence diagram in Unified Modeling Language (UML) is a kind of interaction


diagram that shows how processes operate with one another and in what order. It is a
construct of a Message Sequence Chart. Sequence diagrams are sometimes called event
diagrams, event scenarios, and timing diagram.

Fig 3: Sequence diagram

DEPT OF MCA RGMCET NANDYAL 19


Performance Analysis On Student Feedback Using Machine Learning Algorithms

4.6 ACTIVITY DIAGRAM:

Activity diagram is another important behavioural diagram in UMLdiagram to


describe dynamic aspects of the system. Activity diagram is essentially an advanced version
of flow chart that modelling the flow from one activity to another activity.

Fig 4: Activity Diagram

4.7 INPUT AND OUTPUT DESIGN

DEPT OF MCA RGMCET NANDYAL 20


Performance Analysis On Student Feedback Using Machine Learning Algorithms

4.7.1 INPUT DESIGN

The input design is the link between the information system and the user. It
comprises the developing specification and procedures for data preparation and those steps
are necessary to put transaction data in to a usable form for processing can be achieved by
inspecting the computer to read data from a written or printed document or it can occur by
having people keying the data directly into the system. The design of input focuses on
controlling the amount of input required, controlling the errors, avoiding delay, avoiding extra
steps and keeping the process simple. The input is designed in such a way so that it provides
security and ease of use with retaining the privacy. Input Design considered the following
things:

 What data should be given as input?


 How the data should be arranged or coded?
 The dialog to guide the operating personnel in providing input.
 Methods for preparing input validations and steps to follow when error occur.

4.7.2 OBJECTIVES

1. Input Design is the process of converting a user-oriented description of the


input into a computer-based system. This design is important to avoid errors in the
data input process and show the correct direction to the management for getting
correct information from the computerized system.

2. It is achieved by creating user-friendly screens for the data entry to handle


large volume of data. The goal of designing input is to make data entry easier and to
be free from errors. The data entry screen is designed in such a way that all the data
manipulates can be performed. It also provides record viewing facilities.

3. When the data is entered it will check for its validity. Data can be entered
with the help of screens. Appropriate messages are provided as when needed so that
the user will not be in maize of instant. Thus the objective of input design is to create
an input layout that is easy to follow

4.8 OUTPUT DESIGN

DEPT OF MCA RGMCET NANDYAL 21


Performance Analysis On Student Feedback Using Machine Learning Algorithms

A quality output is one, which meets the requirements of the end user and
presents the information clearly. In any system results of processing are communicated to the
users and to other system through outputs. In output design it is determined how the
information is to be displaced for immediate need and also the hard copy output. It is the most
important and direct source information to the user. Efficient and intelligent output design
improves the system’s relationship to help user decision-making.

1. Designing computer output should proceed in an organized, well thought out


manner; the right output must be developed while ensuring that each output
element is designed so that people will find the system can use easily and
effectively. When analysis design computer output, they should Identify the
specific output that is needed to meet the requirements.

2. Select systems for presenting information.

3. Create document, report, or other formats that contain information produced by


the system.

The output form of an information system should accomplish one or more of the
following objectives.

• Convey information about past activities, current status or projections of the


• Future.
• Signal important events, opportunities, problems, or warnings.
• Trigger an action.
• Confirm an action.

4.9 SYSTEM SPECIFICATION:

4.9.1 H/W System Configuration: -

DEPT OF MCA RGMCET NANDYAL 22


Performance Analysis On Student Feedback Using Machine Learning Algorithms

➢ Processor - Pentium–IV

➢ RAM - 4 GB (min)

➢ Hard Disk - 20 GB

➢ Keyboard - Standard Windows Keyboard

➢ Mouse - Two or Three Button Mouse

➢ Monitor - SVGA

4.9.2 SOFTWARE REQUIREMENTS:


 Operating system : Windows 7 Ultimate.

 Coding Language : Python.

 Front-End : Python.

DEPT OF MCA RGMCET NANDYAL 23

You might also like