Movie Recommendation System: Using Machine Learning

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

Movie Recommendation System

Using Machine Learning

Project Synopsis
Of Minor Project

BACHELOR OF TECHNOLOGY
Electronics and Communication Branch

Submitted By Guided by

➢ Yash Kumar . ➢ Dr. Reema Buddhiraja


19102145 (A5) (Internal Supervisor)

➢ Priyanshu Jain
19102148 (A5)

Jaypee Institute of Information Technology


Noida
Introduction

The evolution of technology brings us many advanced platforms such as Machine


Learning, Deep Learning, Data Mining, the Internet of Things (IoT), etc. As we
probably are aware, the world is becoming quicker and everybody is moving
towards accomplishing their objectives. Individuals need more time to go to the
market and purchase things, not simply that, they don't have the opportunity to
pick between things. What's more, this has prompted the innovation of
recommendation systems. Recommendation systems have become well known
nowadays, be it in the field of entertainment, education, etc. Earlier, the users
needed to settle on choices on what books to purchase, what music to tune in to,
what motion pictures to watch and so on.

Commercial movie libraries effectively exceed 15 million films, which


boundlessly exceeds the visual ability of any single individual. With a large
number of motion pictures to browse, individuals now and then get overpowered.
Therefore, an efficient recommendation system is necessary for the enthusiasm
of both movie service providers and customers. With the improvement of
recommendation systems, the customers will have no agony in settling on choices
and organizations can keep up their client gathering and draw in new clients by
improving users' satisfaction. Additionally, nowadays the modern technologies
like machine learning and deep learning also plays a vital role in the process
flexible technologies for day-to-day operations. In this manuscript, we discuss
about the recommendation by using machine learning. Now, we discuss a
method that has been previously implemented.

What is Data Science?


“The ability to take data — to be able to understand it, to process it, to extract
value from it, to visualize it, to communicate it — that’s going to be a hugely
important skill in the next decades.”

What is Machine Learning?


Machine learning is an application of artificial intelligence (AI) that provides
systems the ability to automatically learn and improve from experience without
being explicitly programmed. Machine learning focuses on the development of
computer programs that can access data and use it to learn for themselves.

The process of learning begins with observations or data, such as examples,


direct experience, or instruction, in order to look for patterns in data and make
better decisions in the future based on the examples that we provide. The
primary aim is to allow the computers learn automatically without human
intervention or assistance and adjust actions accordingly.
What is recommendation system?
Recommender systems are the systems
that are designed to recommend things to
the user based on many different factors.
These systems predict the most likely
product that the users are most likely to
purchase and are of interest to. Companies
like Netflix, Amazon, etc.

Types of recommendation system:

User-Based Collaborative Filtering


User-Based Collaborative Filtering (UBCF ) is a technique used to predict the
items that a user might like on the basis of ratings given to that item by the
other users who have similar taste with that of the target user.
Many websites use collaborative filtering for building their recommendation
system.

In general UBCF approach, k-nearest neighbour (k-NN) approach is proved to be


the best method to generate a neighbourhood, which picks the k most similar
(nearest) users from candidate neighbour to comprise the neighbourhood for an
active user (Herlocker and Konstan 2002). So we consider the k-NN UBCF
approach as the traditional UBCF approach in the rest of this paper.
K-NN Algorithm
The K-Nearest Neighbor Algorithm assumes that similar things exist in close
proximity. In other words, similar things are near to each other.

Notice in the image above that “Birds of a feather flock together.”


most of the time, similar
data points are close to each
other. The KNN algorithm
hinges on this assumption
being true enough for the
algorithm to be useful. KNN
captures the idea of
similarity (sometimes called
distance, proximity, or
closeness) with some
mathematics we might have
learned in our childhood—
calculating the distance
between points on a graph.

There are other ways of calculating distance, and one way might be preferable
depending on the problem we are solving. However, the straight-line distance
(also called the Euclidean distance) is a popular and familiar choice.

The KNN Algorithm


1. Load the data
2. Initialize K to your chosen number of neighbours

Advantages
1. The algorithm is simple and easy to implement.
2. There’s no need to build a model, tune several parameters, or make
additional assumptions.
3. The algorithm is versatile. It can be used for classification, regression, and
search (as we will see in the next section).

Disadvantages
1. The algorithm gets significantly slower as the number of examples and/or
predictors/independent variables increase.
Methodologies
This contains a series of steps and the methodology of the proposed system. How
the system is going to operate, and events that are going to occur is briefly
explained in Table. And with the help of a flowchart as shown below.

Collaborative filtering works based on users that have similar tastes. In Table
below, since users A and B have given similar ratings to ‘Reggae’ they both are
considered users having similar likes and dislikes. A has rated 4 for ‘Trance’, so
next time when user B requests for a recommendation, the system will
recommend ‘Trance’ to B since user A has rated 4 to ‘Trance’.

A key advantage of the collaborative filtering approach is that it does not rely on
machine analysable content and therefore it is capable of accurately
recommending complex items such as movies without requiring an
"understanding" of the item itself. Many algorithms have been used in
measuring user similarity or item similarity in recommender systems. For
example, the k-nearest neighbour (k-NN) approach and the Pearson Correlation
as first implemented by Allen. When building a model from a user's behaviour, a
distinction is often made between explicit and implicit forms of data collection.
Flowchart of the proposed system

Facilities required for this project work

We would be using Python as our main coding language.


➢ As it have huge number of python libraries and is an open source language, also it is easy
and useably flexible.

Software we’d be using would mainly be •PyCharm •Google Collaborator.

We would be preparing a dataset of our own, as there are not any good datasets
we could fetch for our programme. (Creating an elaborate and extensive dataset of
famous movies is under progress)
References:

1. Data science,
https://www.ibm.com/cloud/learn/data-science-introduction
2. Data science explained,
https://www.oracle.com/in/data-science/what-is-data-science/
3. Machine Learning,
https://searchenterpriseai.techtarget.com/definition/machine-learning-
ML
4. Recommendation System,
https://en.wikipedia.org/wiki/Recommender_system
5. Type of RS,
https://www.bluepiit.com/blog/classifying-recommender-
systems/#:~:text=There%20are%20majorly%20six%20types,system%20and
%20Hybrid%20recommender%20system.
6. UBCF algorithm,
http://rstudio-pubs-
static.s3.amazonaws.com/248530_18970dc8eb4046a6b4f2fba987fe2a50.ht
ml
7. k-NN Algorithm,
https://en.wikipedia.org/wiki/K-nearest_neighbors_algorithm

8. https://towardsdatascience.com/how-to-build-a-movie-recommendation-
system-67e321339109
9. https://www.geeksforgeeks.org/python-implementation-of-movie-
recommender-system/
10. https://www.kaggle.com/ibtesama/getting-started-with-a-movie-
recommendation-system
11. https://www.inderscience.com/info/inarticle.php?artid=107079

References for software;


12. PyCharm: https://www.jetbrains.com/pycharm/
13. G. Colab: https://colab.research.google.com/

You might also like