Professional Documents
Culture Documents
Conference
Conference
Conference
Mrs. B. Narmada,
Assistant Professor,
I. INTRODUCTION
In generally document search method is based on user need
to access in less time process. In every user point of view they
need to search a document frequently and meaningfully
constructed. Document type is like as .ppt (PowerPoint
Presentation), .pdf (Portable Document Format), .docx
(Microsoft Word 2007/2010/2013 document) and other formats.
This research providing the search engine for to search a
document like .ppt, .pdf, .docx to based on the user
recommendation. In this search engine based on the Google
Scholar. It will be provide the only research paper based on
the keywords. The result only based on .pdf documents. It is
also providing the .pdf direct link of document to refer on the
IEEE website.
Similar concept is based on the SlideShare PPT website
is provide the only type of .ppt (PowerPoint Presentation)
documents. In this SlideShare website also provides the
privacy for every user to maintain their history. It used for
easily retrieve the previous document already user visited on
about the topic and the link will be added on the user history.
It will be used for ranking concept at next time of this user
visits the same topic in the search engine.
New user is fresher for this search engine to search the
topic about Introduction about Data Mining. Then search
engine also provide some result for to relevant the topic in
result page. The new user is visits the some of the tab on page.
Its also added to the user history database. In this concept
represented on figure 1 show in on above. This figure based on
same topic to search and based on various users.
C. Pattern Clustering
The snippets are consider the user enter the query has been
keyword for the searching document. Here the keyword refers
the snippets. Snippets are used for to extract the document
from keyword to matching formats. Pattern extraction
algorithm considers all the words in a snippet, and is not
limited to extracting patterns only from the mid fix. The
consideration of gaps enables us to capture relations between
distant words in a snippet. We use a modified version of the
prefix span algorithm to generate subsequences from a text
snippet. We use the constraints (2-4) to prune the search space
of candidate subsequences. We showed how to extract lexical
patterns from snippets to represent numerous semantic
relations that exist between two words. In this section, we
describe a machine learning approach to combine both page
counts-based co-occurrence measures, and snippets-based
lexical pattern clusters to construct a cosine similarity [2] [3]
[4] based measure.
A. User Module
V. CONCLUSION
In this approach user registration system and its
functionalities are implemented. Pattern extraction is
exercised using cosine similarity in which keyword produced
by user plays a major role. When the user provides keyword, it
is used as snippets which are matched with already maintained
database. On matching of the snippets with the database
information the exact result is produced. The history of
previous documents viewed by the user is stored for later
access and analysis.
In the above approach the result brings both related and
unrelated information for the given search.
In order to
produce only related documents for the search, the concept of
clustering is proposed. A cluster of information is developed
based on the close relation exhibited documents. Thus for
given search documents which highly related for the
information is being searched is produced. Based on the user
history, the information is rated by using Visit of Link (VOL)
algorithm. This approach is applied only for document type
content. The proposed system increase the efficiency and time
conception for the search of information in web is reduced.
Future work of this approach has the above mentioned
ranking result is based on individual user. But if two links are
having same counting at to produce a result will be based on
time or some other approaches are going too implemented.
REFERENCES
[1].