Professional Documents
Culture Documents
Major
Major
4/11/12
OUTLINE
OBJECTIVE INTRODUCTION OF WEB CRAWLER USES OF CRAWLER WORKING OF CRAWLER PROBLEM SPECIFICATION PROBLEM SOLUTION ANALYSIS OF PROPOSED SYSTEM STRUCTURE CONCLUSION
4/11/12
OBJECTIVE
4/11/12
Introduction of crawler
AWeb
crawleris a computer program that browses theWorld Wide Webin a methodical, automated manner or in an orderly fashion. Crawler is also known as web spider, ants,automatic indexers , bots,Web spiders,Web robots.
Web
4/11/12
Uses of crawler
q
to create a copy of all the visited pages for later processing by a search engine that willindexthe downloaded pages to provide fast searches. for automating maintenance tasks on a Web site, such as checking links or validatingHTMLcode. to gather specific types of information from Web pages.
4/11/12
4/11/12
4/11/12
Problem Specification
Need Pages
4/11/12
Problem Solution
Designing
crawler.
This
will provide fast data retrieval and thus will result in fast searching.
4/11/12
Multisystem
Multisystem refers to being able to run on multiple systems. we are using Java technology hence it will be able to run on various systems having Java Platform.
4/11/12
Since
Contd..
Multithrea
ded :
4/11/12
Fetching Parsing
URL
Stoplisting
HTML
Multi-threaded
Conclusion
Due
to the dynamism of the Web, crawling forms the back-bone of certain web applications. facilitates Web information retrieval. the typical use of crawlers has been for creating and maintaining indexes for general purpose search-engine. usage of crawlers is emerging both for client and server based applications.
It
While
Diverse
4/11/12
Queries
4/11/12