Professional Documents
Culture Documents
Music Recommendation System
Music Recommendation System
Music Recommendation System
Internship Seminar
on
Music Recommender System
• Music dataset is too big while life is short!!!! You need someone to
teach you how to manage and give you wise suggestions
according to your taste!
• Music service providers need a more efficient system to attraction their clients!
OUR SYSTEM
User‘s Music
listening Prediction of
history
&music Recommender songs that
user will listen
informatio to
n
System
Item-based User-based
🞂 COLLABORATIVE FILTERING
This method utilizes preferences and behaviors of other users to come up with
recommendations.
The general operation of these systems is to pair users with similar tastes together into
groups. Recommendations are then made based on the collective preferences of the users
within each group.
Thus if user x and user y are both members of the same preference group, and user x
likes a particular sample, then there should be a high probability that user y also like this
sample. Obviously, this method requires a significantly large initial dataset to provide
relevant predictions.
1. Based on item’s description Similarity !=
and user’s preference profile recommendation (no
2. Not based on choices of notion of personalization)
other users with
similar interests
3. We make recommendations
by looking for music Majority of songs have too
whose features are very few listeners, so difficult to
similar to the tastes of the “collaborate”
user
Idea
Exploratory Data Exploratory Data Analysis
EDA is an approach to analyzing the data using visual techniques. It is used to discover
trends, and patterns, or to check assumptions with the help of statistical summaries and
graphical representations.
The dataset we have contains around 14 numerical columns but we cannot visualize
such high-dimensional data. But to solve this problem t-SNE comes to the rescue. t-
SNE is an algorithm that can convert high dimensional data to low dimensions and uses
some non-linear method to do so which is not a concern of this article.
S System Architecture
🞂
Data Collection and Understanding Process:
The real dataset is used for the research. We have taken music data which
contains 2000 records and 15 fields, including categorical and numeric
features. Each record in the music information, and each field in the record
represents a feature of that particular employee.
Data Preparation and Pre-processing:
After the process of data collection is finished, the process of preparing
the data is performed. It is important to refine this data so that it can be
suitable for the models and generate better results. In this phase we
performed tasks like cleaning, filling the missing data, and removing
unwanted data. The data of Spotify had various attributes which were not
relevant, i.e., as not giving any useful information, like Title, Artist, Top
Genre, Energy, BPM, Liveness, etc.; hence these attributes are removed
in this phase.
Feature Selection:
Feature
🞂 selection is one of the main concepts of DM and Machine Learning. Where it
is a process of selecting necessary useful variables in a dataset to improve the results
of machine learning and make it more accurate, there are a lot of columns in the
predictor variable. So, the correlation coefficient is calculated to see which of them
are important and these are then used for training methods. From there, we get the top
factors that affect performance.
Results:
First latent factors capture properties of the most popular items,
while the additional latent factors represent more refined
features related to unpopular items.
[2] Aiolli, F. (2012). A preliminary study on a recommender system for the million
songs dataset challenge. PREFERENCE LEARNING: PROBLEMS AND APPLICATIONS IN AI
[4] Cremonesi, Paolo, Yehuda Koren, and Roberto Turrin. "Performance of recommender
algorithms on top-n recommendation tasks." Proceedings of the fourth ACM
conference on Recommender systems. ACM, 2010