Image Scrapper

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 14

Image Scrapper

Using Python

Arun Chaudhary
1/20/FET/BCS/086

Mohit Chaudhary
1/20/FET/BCS/087

Shashank Rai
1/20/FET/BCS/106

Supervisor Arnav Lakha


Dr. Ravindra Chahar 1/20/FET/BCS/112

Shourya Ahuja
1/20/FET/BCS/115
Table of Contents
• 1)Introduction
• 2) Problem Statements
• 3) Objectives
• 4) Hardware and software requirements
• 5) Literature Review
• 6)System Design
• 7) Methodology
• 8) Expected Outcome Of project /Result
• 9) Conclusion & Future Scope
• 10)References
Introduction
• Scraping: the actual extraction of data / information from a web page.

• Image scraping is a subset of the web scraping technology. While web


scraping deals with all forms of web data extraction, image scraping
only focuses on the media side – images, videos, audio, and so on.

• There are many different tools for scraping available, which differ in
their functionality and use.
PROBLEM STATEMENT
• Data collection is an infrequently talked about topic in the machine
learning/deep learning space. While there are a number of preloaded datasets
on libraries such as PyTorch and Scikit-Learn, one might need to collect and
curate custom datasets for a specific project.

• There are a number of ways to go about data collection such as taking readings
from data collection instruments or manually recording observations where
suitable. In a computer vision context, the low hanging fruit for data collection is
scraping pre-existing images from web pages.
Project Objectives
• The project objectives for image scraping will depend on the specific
goals of your project. However, here are some common objectives
that image scraping can help achieve:

• 1. Collecting data for research: Image scraping can be used to collect


data for various research purposes, such as analyzing trends,
identifying patterns, or studying consumer behavior.

• 2. Creating a dataset for machine learning: Image scraping can be


used to create a dataset for machine learning models, such as image
recognition or object detection algorithms.
Hardware and software requirements
• HTTP: the communication protocol
• HTML: the language in which web pages are defined
• JS: javascript (code executing in the browser)
• Python : Language used to extract images from the webpage
• CSS: style sheets, how web pages are styled. Important, but does not contain
data.
• JPG, PNG, BMP: images
• CSV / TXT / JSON / XML: data
3. Building a product catalog: E-commerce companies can use image
scraping to build a product catalog for their online store. This can help
automate the process of collecting product images, descriptions, and
prices.

4. Brand monitoring: Companies can use image scraping to monitor social


media platforms and online forums for brand mentions and visual content
related to their products or services.

5. Creative projects: Image scraping can be used for creative projects, such
as creating a digital art collection, generating visuals for social media
posts, or building a stock image library.
Literature Review
• Data has a vital role in business, marketing, engineering, social sciences, and
other disciplines of study since it may be utilized as a starting point for any
operations activities that include the exploitation of information and knowledge.
The initial step of research is data collecting, followed by the systematic
measurement of information about important factors, allowing one to answer
inquiries, formulate research questions, test hypotheses, and assess outcomes.
Data collection methods differ depending on the subject or topic of study, the
type of data sought, and the user’s aims. Depending on the goals and conditions,
the method’s application methodology can also change without jeopardizing data
integrity, correctness, or reliability [1]. There are numerous data sources on the
Internet that might be employed in the design process. The technique of
extracting data from websites is often known as web scraping, web extraction,
web harvesting, web crawler.
Methodology
Expected Outcome
• The expected outcome for image scraping will depend on the specific
objectives of your project. However, here are some possible outcomes
that can be achieved through image scraping:

• 1. A dataset of images: One of the most common outcomes of image


scraping is a dataset of images that can be used for various purposes,
such as research, machine learning, or creative projects.

• 2. Insights and analysis: Image scraping can provide insights and


analysis on various topics, such as consumer behavior, visual trends,
or social media conversations related to specific products or brands.
• 3. Automated data collection: Image scraping can help automate the
process of collecting and organizing visual content, reducing the
amount of manual work required.

• 4. Improved product catalog: E-commerce companies can use image


scraping to improve their product catalog, ensuring that it is up-to-
date and accurate.

• 5. Competitive intelligence: Image scraping can provide insights into


competitors' visual content, helping companies better understand
their market position and identify areas for improvement.
References
• https://www.publications.scrs.in/uploads/final_menuscript/863dc562
8ae9215e611c22943d061742.pdf

• https://towardsdatascience.com/image-scraping-with-python-a96fed
a8af2d
Thank You!

You might also like