Professional Documents
Culture Documents
David Magar 20048430
David Magar 20048430
David Magar 20048430
I confirm that I understand my coursework needs to be submitted online via Google Classroom under the relevant
module page before the deadline for my assignment to be accepted and marked. I am fully aware that late
submissions will be treated as non-submission and a mark of zero will be awarded.
Table of Contents
1. Introduction ............................................................................................................... 1
Synonymy .................................................................................................................... 2
Privacy ......................................................................................................................... 2
Sparsity........................................................................................................................ 3
2. Background .............................................................................................................. 3
3. Solution ....................................................................................................................... 6
Flowchart ..................................................................................................................... 8
Pseudocode .................................................................................................................... 9
4. Conclusion: ............................................................................................................. 10
References .................................................................................................................... 12
DAVID MAGAR CU6051NT
1. Introduction
Artificial intelligence is the simulation of human intellectual processes by machines,
especially computer systems. Specific applications of AI include expert systems, natural
language processing, speech recognition, and computer vision.
As the hype around AI grows rapidly, vendors have scrambled to promote how their
products and services use AI. AI requires specialized hardware and software platforms
to write and train machine learning algorithms. No programming language is
synonymous with AI, but a few are popular, including Python, R, and Java. Typically, AI
systems work by ingesting large amounts of labeled training data, analyzing the data for
correlations and patterns, and using those patterns. make predictions about future
states. This way, a chatbot that receives text chat examples can learn to create real-life
interactions with people, or an image recognition engine can learn to identify and
describe objects in images by examining millions of examples.
1.1. Explanation of the Topic and AI Concept used
The topic here is about the recommendation system and within the recommendation
system the main part of the work is music recommendation system. By providing
dataset with rating of user and names of the music, the algorithm rates the items and
show items to the user that they would rate highly. Various lists, Methods are needed to
function properly and give certain output. Sites like Spotify are an example to music
recommendation system. A recommendation system is a system designed to
recommend things to a user based on a variety of factors. These systems predict the
most likely product that a user is most likely to buy and be interested in. The
recommendation system processes a large amount of available information by filtering
out the most important ones based on user-supplied data and other factors of interest
and interest to the user. It finds a match between the user and the item and gives the
similarity between the user and the item for the recommendation.
Both the users and the services provided benefit from these types of systems. Quality
and decision making have also improved with these types of systems.
Sparsity
Data scarcity refers to the difficulty of finding reliable similar users, as active users
typically only rate a small portion of items.
2. Background
Many research are done for this Topic. Algorithms like Pearson Correlation or
Manhattan Distance and k means clustering were used and studied.are researched
and studied. As in todays context music streaming platform becomes the dominant
medium for people to listen to the songs they’ve wanted. Music streaming service are
now able to collect huge amount of data on the listening pattern of their customer. The
streming sites like Apple music, Spotify, Pandora are using the data of their data to
provide recommendation to their customer. These music system filters the data to
predict a user pattern when it comes to a certain item. There are two approaches to
recommender system i.e. Collaborative Filtering and Content Based Filtering.
Collaborating Filtering
Collaborative filtering is the best known application recommendation engine and is
based on computed heuristics; People who like the product will like the same product in
the future. This type of algorithm is also known as product-based cooperative change.
In this filtering, users are filtered and associated with individual users instead of items.
In this system, only the user's behavior is taken into account. Just their content and
profile information is not enough. Users who give a positive review for a product will be
linked to the behavior of other users who give a similar review.
The main idea behind this approach is to offer new items based on the behavioral
proximity of similar customers.
Collaborative filtering requires a set of factors based on a user's historical selection.
This system does not require many product functions to function. The integration or
feature vector describes each item and user, and places both the item and the user in a
similar integrated location. He himself creates boxes for elements and users.Feedback
from other buyers is considered when recommending a particular product to key users.
It tracks the behavior of all users before suggesting which items are most liked by the
user. It also connects similar users with similar interests and behaviors towards a similar
product when offering the product to key customers.
Two sources are used to record product user interactions. First of all, through implicit
comments, users' likes and dislikes are recorded and noticed by their actions such as
clicking, listening to music, searching, recording purchases, viewing pages, etc.
On the other hand, explicit feedback occurs when customers specify that they do not
like or dislike by rating or reacting to a particular product on a scale of 1 to 5 stars.
These are direct user reviews to express their likes and dislikes for the product. It
includes both positive and negative feedback.
3. Solution
3.1. Proposed Solution /approach to solving the problem
All the people enjoy the music. Music Recommendation System is useful in many
aspects. The recommendation system navigates the user to the similar product or the
music they like or enjoy which also helps companies for their content people and to
attract more people. People like to share the music or the music genre with the other
people and as for the recommendation system by providing similar music, it helps to
hook up that user on the system.
Clustering is the process of dividing a population or data points into several groups so
that the data points in the same group are similar to other data points in the same group
and different from the data points in other groups.
k- means clustering algorithm is used in this project. Clustering K-means is a simple
unsupervised learning algorithm used to solve clustering problems. It follows a simple
process of classifying a certain data set into a number of clusters, identified by the letter
"k", which are pre-fixed.
It tries to make the intra-cluster data points as similar as possible while also keeping the
clusters as different (far) as possible. It assigns data points to a cluster such that the
sum of the squared distance between the data points and the cluster’s centroid
(arithmetic mean of all the data points that belong to that cluster) is at the minimum. The
less variation we have within clusters, the more homogeneous (similar) the data points
are within the same cluster.
Flowchart
Pseudocode
Choose number of clusters, K
Initialize K centroids at random
FOR each datapoint
Assign each datapoint to the closest centroid
Compute the distance between the datapoint and the cluster centroid
Repeat until the centroid don’t change
END
4. Conclusion:
Analysis of the work done
10 | P a g e ARTIFICIAL INTELLEGENCE
DAVID MAGAR CU6051NT
11 | P a g e ARTIFICIAL INTELLEGENCE
DAVID MAGAR CU6051NT
References
(2021). Retrieved from Javatpoint: https://www.javatpoint.com/k-means-clustering-
algorithm-in-machine-learning
12 | P a g e ARTIFICIAL INTELLEGENCE
DAVID MAGAR CU6051NT
13 | P a g e ARTIFICIAL INTELLEGENCE