Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169

Volume: 4 Issue: 7 267 - 269


____________________________________________________________________________________________________________________
Recommendation Item Based on Keyword Search using Big Data

Jafar Sadik Kamadod Prof. Shrivatsa Koulgi


Mtech IT, SDMCET, Dharwad SDMCET, Dharwad
Email: m.jafarsk@gmail.com Email: shrivatsa.koulgi@gmail.com

Abstract- Design the methodology for piloting the Book Recommendation System. Provide responses on the suitability of Books with respect to
content and likes by users. Determine the suitability of existing technology whether there is need for other recommendations. Moreover, to
improve the scalability and efficiency of service recommendation system in Big Data environment, we have implemented it on a Map Reduce
framework in Hadoop platform. Finally, we recommend books to users based on similarity.

Keywords-web search, big data, mapreduce, collaborative filtering,recommendation system.


__________________________________________________*****_________________________________________________
I. INTRODUCTION keyword to see if there are similar books. We believe that the
With the growing number of alternative online abandoned keywords cover a lot of book demands
services, recommending services that users preferred information. These demands can serve as a reference for
effectively has become an important research issue. Service purchasing books.
recommender systems have shown to be valuable tools to help
users deal with services overload and provide appropriate A. COLLABORATIVE FILTERING:
recommendations to them. Examples of such practical It is a technique used by some recommender systems.
applications include CDs, books, web pages and various other Collaborative filtering has two senses, a narrow one and a more
products now use recommender systems [1], [2]. general one. In general, collaborative filtering is the process of
In recent years, the amount of data in our world has filtering for information or patterns using techniques involving
been increasing explosively and analyzing large data setsso collaboration among multiple agents, viewpoints, data sources,
calledBig Databecome a key basis of competition etc. [5]. Applications of collaborative filtering typically involve
underpinning new waves of productivity growth, innovation, very large data sets. Collaborative filtering methods have been
and consumer surplus [3]. Then, what is Big Data?, Big data applied to many different kinds of data including: sensing and
refers to data sets whose size is beyond the ability of current monitoring data, such as in mineral exploration, environmental
technology, method and theory to capture, manage, and process sensing over large areas or multiple sensors; financial data,
the data within a tolerable elapsed time. Today, Big Data such as financial service institutions that integrate many
management stands out as a challenge for IT companies. The financial sources; or in electronic commerce and web
solution to such a challenge is shifting increasingly from applications where the focus is on user data, etc. The remainder
providing hardware to provisioning more manageable software of this discussion focuses on collaborative filtering for user
solutions [4]. data, although some of the methods and approaches may apply
A search system is designed to search required to the other major applications as well [6].
information like searching a book using book title, using the
searched data, recommend the book to the user with similar IV. METHODOLOGY/ PLANNING OF WORK:
taste. To recommend the books we use collaborative filtering The recommendation system based on keyword search works
approach. solely on the basis of explicit user feedbacks and ratings.

II. RELATED WORK


In the recent years the amount of data has increased in World
Wide Web that yields to big data problem. In Earlier
approach we face problems of scalability and efficiency
because of increasing data. Earlier the recommendation system
is focused on ranking and rating of the user to provide the
services. Now purposed system focused on item of service. The
collaborative filtering technique is used to implement the
recommendation of the active user, The Map Reduce
framework is used to calculate ratings in parallel to increase the
scalability and efficiency of the system because of the large
dataset.

III. BOOK RECOMMENDATION MODEL


Recommendation system grounded itself on the traditional
bibliographic retrieval system. Readers used to look up a book
using the retrieval system. Sometimes the system will return an
empty set, which means there is no book that contains the
readers keyword. In this case, readers have to change the Fig.1 Architecture

267
IJRITCC | July 2016, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169
Volume: 4 Issue: 7 267 - 269
____________________________________________________________________________________________________________________
The main steps of Recommendation system based on keyword recommendations and the human-computer interaction, RSs
search are depicted in Fig. 1, which are described in detail as exploit a range of information about the users. This information
follows: can be structured in various ways and again the selection of
what information to model depends on the recommendation
A. Capture user preferences by a keyword-aware technique.
approach: In this step, the preferences of active users and
previous users are formalized into their corresponding G. Pearson correlation coefficient algorithm
preference keyword sets respectively. An active user refers to a Correlation between the set of important items is calculated
current user needs recommendation. using the Pearson coefficient algorithm.

1) Preferences of a current user. A current user can give Sim(APK, PPK)=kk (ra,k ra)(rp,k rp)
his/her preferences about candidate services by selecting kk (ra,k ra) kk (rp,k rp)
keywords from a keyword-candidate list or Search a book
H. Map Reduce
2) Preferences of existing users. The preferences of a previous It is used to process the input in parallel way. Here we compute
user for a candidate service are extracted from his/her reviews similarity and generate the highest rated item and calculate the
for the service according to the keyword-candidate list or ratings and ranking in parallel. [7]
search a book for recommendation
V. MODULES
B. Preprocessing
The reviews of previous users are stored into the A. Login and book details
Dataset. The Dataset was downloaded from the informatics The Login Form module presents site visitors with a form with
site. Preprocessing is carried out on a Dataset and gets the username and password fields. If the user enters a
previous user rating. validusername/password combination they will be granted
access to additional resources on your application.
C. Similarity computation. The second step is to identify the Whichadditional resources they will have access to can be
reviews of previous users who have similar tastes to an active configured separately [8].
user by finding neighborhoods of the active user based on the
similarity of their preferences. Before similarity computation, In this module admin can add new Book titles and Images
the reviews unrelated to the active users preferences will be details. These details will add tothe existing details. User can
filtered out by the intersection concept in set theory. If the select the books details added in this module and they will rate
intersection of the preference keyword sets of the active user the Book based on their reviews. These details are used for
and a previous user is an empty set, then the preference cluster the data based on their ratings [9].
keyword set of the previous user will be filtered out.
B. MapReduce Programming Model
sim (APK, PPK) = Jaccard(APK, PPK) = |APKPPK| 1) Map, written by the user, takes an input pair and
|APKPPK| produces a set of intermediate key/value pairs. The MapReduce
library groups together all intermediate values associated with
D. Compute ratings and generate recommendations. The the same intermediate key I and passes them to the Reduce
rating can be calculated based on the similarity of the active function.
user and previous users, further filtering will be conducted and
calculate the personalized ratings and collects the highest 2) It processes the reviews of the existing user with their
ratings using nearest neighbors algorithm. preference and compute average rating for each user.

Top S with nearest neighbors algorithm 3) The Reduce function, also written by the user, accepts
G a,k= ra + S(ru,k-ru)xW(s1,s2) an intermediate key I and a set of values for that key. It merges
S(s1,s2) together these values to form a possibly smaller set of
E. Items: values.Typically just zero or one output value is produced per
Items are the objects that are recommended. Items may be Reduce invocation. The intermediate values are supplied to the
characterized by their complexity and their value or utility. The users reduce function via an iterator. This allows us to handle
value of an item may be positive if the item is useful for the lists of values that are too large in memory.
user or negative if the item is not appropriate and the user made
a wrong decision when selecting it. We note that when a user is 4) The Reduce task calculates the personalized ratings
acquiring an item she will always incur in a cost, which and recommends the books with similar taste.
includes the cognitive cost of searching for the item and the
real monetary cost eventually paid for the item. 5) Map Reduce Specification Object. Contains names of
input/output less and optional tuning parameters. The user then
F. Users: invokes the MapReduce function, passing it the specification
Users of a recommended system may have very diverse goals object. The users code is linked together with the MapReduce
and characteristics. In order to personalize the library.

268
IJRITCC | July 2016, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169
Volume: 4 Issue: 7 267 - 269
____________________________________________________________________________________________________________________
Example: In table1 the three users rated the books commonly,
which means they have similar taste, but the user not rated the
book (HTML), which means he/she hasnt saw that book. So
by the ratings of user A and C recommend the book to user B.

VII. CONCLUSION
In this paper, we propose Online Book Recommendation
System Based on Keyword Search Using Big Data Analysis,
keywords are used to indicate users preferences, and a user
based Collaborative Filtering algorithm is adopted to generate
appropriate recommendations. More specifically, The active
user gives his/her preferences by selecting the keywords from
the keyword-candidate list, and the preferences of the past
users can be extracted from their reviews and recommending
the most appropriate service(s) to the users. Moreover, to
improve the scalability and efficiency of our approach in big
data environment, we implement it in a Map-Reduce
framework on Hadoop platform.

VIII. REFERENCES
[1] G. Linden, B. Smith, and J. York, Amazon.com
Fig.2 MapReduce implementation model Recommendations: Item to-Item Collaborative Filtering,
IEEE Internet Computing, Vol. 7, No.1, pp. 76-80, 2003.
6) MapReduce computation done parallel to improve the [2] B. N. Miller, I. Albert, S. K. Lam, J. A. Konstan, and J.
performance of the system Riedl, MovieLens Unplugged: Experiences with an
Occasionally Connected Recommender System, In IUI 03
Proceedings of the 8th International Conference on
7) MapReduce work on key-value pairs, Key i and value
Intelligent User Interfaces, pp.263-266, 2003.
i is the pairs of existing user reviews input to MapReduce [3] J. Manyika et al., Big Data: The Next Frontier for
framework, where key j and value j are the pair input of the Innovation,Competition, and Productivity, 2011.
current users. Using this MapReduce compute the rating in [4] C. Lynch, Big Data: How Do Your Data Grow? Nature,
parallel to increase the scalability and efficiency of the vol. 455,no. 7209, pp. 28-29, 2008.
recommendation system. [5] K.G.S. Venkatesan, Dr. Kathir. Viswalingam, N.G. Vijitha,
Associate Adaptable Transactions Information store in the
VI. REQUIREMENTS ON ONLINE BOOKS cloud using Distributed storage and meta data manager,
RECOMMENDATION International Journal of Innovative Research in computer &
communication Engineering, Vo l . 3 , Issue3 , PP. 1548
New user must register first to get one users account ID. The
15 55, March 2015
given information includes username, first name, last name, [6] M. A. Beyer and D. Laney, The importance of big data:
email address and password. After registration the registration A definition, Gartner, Tech. Rep., 2012.
details are stored in database, which we can retrieve the [7] Y. Zhu and Y. Hu. 2006. Enhancing search performance on
information at the time of login. One user can only have only Gnutella-like P2P systems, IEEE Transactions on Parallel
one account. and Distributed Systems, Vol. 17, No. 12, pp. 1482-1495.
[8] K.G.S. Venkatesan and M. Elamurugaselvam, Using the
The book details are identified by ISBN. For each book, the conceptual cohesion of classes for fault prediction in object-
system also needs to record its authors name, title, edition, oriented system, International journal of Advanced &
Innovative Research, Vol. 2, Issue 4, PP. 75 80, April
year, category, publisher etc.
2013.
[9] K.G.S. Venkatesan, R. Resmi, R. Remya, Anonymizimg
After login the user search for books using keyword, Geographic routing for preserving location privacy using
collaborating filtering technique is used to recommend the user unlinkability and unobservability, International Journal of
with similar taste that the user is looking for the record. Advanced Research in computer science & software Engg.,
Vol. 4, Issue 3, PP. 523 528, March 2014.
TABLE I.
USER- ITEM RATINGS
Books Bytes of Bokeh Pandas Ipython HTML
Users Python

A 8 8 8 8 9
B 9 8 8 9 -
C 8 7 8 10 10

269
IJRITCC | July 2016, Available @ http://www.ijritcc.org
_______________________________________________________________________________________

You might also like