Download as doc, pdf, or txt
Download as doc, pdf, or txt
You are on page 1of 4

SRS DOCUMENT ON WEB GRABBER BY N.SRINIVASA RAO A.KARTHIK M.HARINATHA REDDY D.

ANIL KUMAR (08N81A1240) (08N81A1247) (08N81A1246) (08N81A1253)

UNDER THE ESTEEMED GUIDANCE OF


INTERNAL GUIDE: MISS. B. ALEKYA(ASSISTANTPROFESSOR,CSE)

DEPARTMENT OF INFORMATION TECHNOLOGY SPHOORTHY ENGINEERING COLLEGE


(Affiliated to Jawaharlal Nehru Technological University, Hyderabad, A.P) Nadergul, Saroornagar mandal, Hyderabad-5010510

Web Grabber
Introduction: The proposed project is a high-speed, multi-threading website download and viewing program. By making multiple simultaneous server requests, the program can quickly download entire website or part of a site including HTML, graphics, Java Applets, sound and other user definable files, and saves all the files in the hard drive in their native format. After downloading, all links within the website are reconstructed creating a complete hard drive copy of the site that you can view at your own pace without being connected to the Internet. Additionally project should duplicate the original directory structure of a site making it easy to download and transfer a site to another server. If Internet connection is lost while downloading a site, the Resume Session allows picking up a session. If one wish to update a previously downloaded site, the Update Session feature allows him to revisit a site using new search parameters to make sure he has the most current files .The program will have an embedded quick-view browser window in the main program screen. The user can launch his default browser for viewing any downloaded projects. Existing System: Here in the existing system the downloading can be performed as a single thread i.e. the entire size data is combinedly processed at a time Here the Internet connection is must until the download of is complete. We can transfer only website link addresses and it doesnt download the part of the website.

Disadvantages:

If Internet connection is lost while downloading a site, there is no Resume Session to allow picking up a session. Here the speed of downloading is very slow because of no multi-threading applied. Difficulty in updating, resume the previous download sites. Since the existed system doesnt maintain any Update session feature.

Proposed Systems: Here the proposed system is to replace an existing system, with a solution called Web Grabber, which is a high-speed, multi-threading website download and viewing program. So that after downloading, all links within the website are reconstructed creating a complete hard drive copy of the site that you can view at your own pace without being connected to the Internet. And if one wishes to update a previously downloaded site, the Update Session feature allows him to revisit a site using new search parameters to make sure he has the most current files.

Advantages:

High-speed, multi-threading website downloads. Resume feature to pick up a session where left off. Update feature to download new or modified files. Built-in file viewer in onboard browser window to view files offline. User-selectable recursion levels, retrieval threads, timeout and proxy support. Accesses password-protected sites

FUNCTIONAL REQUIREMENTS : This web site requires the following functional abilities in Server (computer), clients (computers) and users (Humans) Server-side Requirements: Operation System Web Server Framework Database Minimum Space : Windows 2003 or XP or higher : tomcat 5.5 or above : J2EE Framework : Oracle 9i : 5GB (including Database space)

Client-side Requirements: Operation System Browser : Windows 2003 or XP or higher : Internet Explorer or Any web browser.

NON FUNCTIONAL REQUIRWMWNTS : The system should be web-based system. Users should use the system via internet. Each user should have a user account. The system should ask the username and password to users. It doesnt permit to unregistered user to access for WEB GRABBER. The system should have Role based System functions access. Approval Process has to be defined. The system should have Modular customization components so that they can be reused across the implementation.

These are the mainly following: Secure access of confidential data (users details). SSL (Secure Sockets Layer) can be used. 24 X 7 availability Better component design to get better performance at peak time Flexible service based architecture will be highly desirable for future extension

MODULES: GUI module: This is a graphical user interface module to view programs and other data. Thread Algorithm module: This module helps to download entire website or part of a site including HTML, graphics, Java Applets, sound and other user definable files. Connection module: All information about a particular topic from different sites can be found at one place. Environment: Software Requirements: Technologies: java, jsp, servlet, JDBC. IDE: Eclipse 3.5. Server: Web logic /Tomcat. Database: Oracle 11i. Hardware Requirements: RAM: 512 MB. HD: 40GB. Processor: Pentium IV.

You might also like