Welcome to Scribd!

Big Data Searching FIRST Review

Uploaded by

0% found this document useful (0 votes)

7 views10 pages

This document discusses building a big data search engine using machine learning. It aims to develop a search engine that can retrieve the most relevant textual data from a collection of documents based on user queries. The proposed search engine is intended to be more accurate than existing search engines by using machine learning techniques. It will also address limitations of current approaches like storage requirements, data cleaning needs, quality control issues, and security/privacy concerns that often come with big data.

Original Description:

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

7 views10 pages

Big Data Searching FIRST Review

Uploaded by

Ash

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 10

Search inside document

BIG DATA SEARCH USING MACHINE

LEARNING

Presented By: Guided by:

V.Aarsan (210919104001) Dr.G.Bhuvaneswari,

H.Ashish Joyson (210919104007 ) Head Of The Departmrnt(CSE)

CONTENT
 DOMAIN DESCRIPTION
 AIM AND OBJECTIVES
 LIMITATIONS
 PROBLEM STATEMENT
 PROPOSED SYSTEM
 SCOPE OF PROPOSED SYSTEM
DOMAIN DESCRIPTION

Machine learning is an application of AI that enables systems to learn and improve

from experience without being explicitly programmed.
Machine learning focuses on developing computer programs that can access data and
use it to learn for themselves.
The robot-depicted world of our not-so-distant future relies heavily on our ability to
deploy artificial intelligence (AI) successfully.
AIM AND OBJECTIVE

 The Big Data revolution promises to transform how we live, work, and think by
enabling process optimization, empowering insight discovery and improving decision
making.
 Today, the amount of data is exploding at an unprecedented rate as a result of
developments in Web technologies, social media, and mobile and sensing devices.
The ML will be one of the main drivers of the Big Data revolution.

For example, Twitter processes over 70M tweets per day, thereby generating over
8TB daily.
EXISTING SYSTEM
Information retrieval is to retrieve the information resources that we are interested in
or extract whatever information we need.
• Information Retrieval (IR) may deal with the organization, storage,
retrieval and evaluation of information from documents, particularly
textual information.
• But we cannot give the ranks to those documents.
Various sources report that 65-100% of Big Data Analytics projects fail. Gartner, a
research and advisory company, claims that 60% of big data projects would fail to move
past preliminary stages in 2017 (Gartner Inc., 2015).
LIMITATIONS

• Storage: datasets can require considerable resources to store

• Formatting and data cleaning: advanced computer science can be required before
the data is analyzable
• Quality control: can be difficult and often has to be done through small
representative samples
• Security and privacy concerns: often more complex than for traditional datasets
• Accuracy and consistency of methods: many approaches are relatively new and
imperfect, although these may continue to improve
PROBLEM STATEMENT

A Unified from of database software that can search text (numeric, alphanumeric,
alphabetic) or word within any from of data/file such as jpeg, png, pdf, word, excel,
SQL and other formats/proforma which also includes interconnectivity with
predefined key word within our database.
PROPOSED SYSTEM
The main focus of our system is to build a search engine using machine
learning technique for increasing accuracy compare to available search engine.
• The proposed search engine is very useful for finding out more relevant URLs for given
keywords.
• Anyone can easily identify the important documents in a collection of documents and
retrieve the related data.
• It proposes a novel model
We design and implement an in-memory index and extensively evaluate it in
comparison with several representative indexes, including B+ tree, skip list, Adaptive Radix
Tree. Experiment results outperforms the indexes
SCOPE OF PROPOSED SYSTEM

 The proposed system aims to develop a search engine using machine learning
techniques that can retrieve the most relevant textual data from a collection of
documents based on user queries.
 The proposed search engine can be useful for a variety of applications, including
academic research, business intelligence, and general information retrieval.
 The system's novel model can be a significant contribution to the field of
information retrieval.
Thank You

RS485 - MODBUS Communication Protocol - Solis Inverters
Document47 pages
RS485 - MODBUS Communication Protocol - Solis Inverters
Anh Đinh Vương
No ratings yet
Unit-Iii CC&BD CS71
Document89 pages
Unit-Iii CC&BD CS71
Hael
No ratings yet
Big Data History: Ideal Institute of Technology
Document11 pages
Big Data History: Ideal Institute of Technology
Abhi Stan Lee
No ratings yet
file1
Document3 pages
file1
rathna
No ratings yet
Big Data and Data Analysis: Offurum Paschal I Kunoch Education and Training College, Owerri
Document35 pages
Big Data and Data Analysis: Offurum Paschal I Kunoch Education and Training College, Owerri
Sixtus Okoro
No ratings yet
Getting Started With Hadoop Planning Guide
Document24 pages
Getting Started With Hadoop Planning Guide
Mohammed Zahir Al-Kilani
No ratings yet
Ccs 334
Document16 pages
Ccs 334
Amsaveni .amsaveni
No ratings yet
Big Data and Analytics
Document23 pages
Big Data and Analytics
mailyouranand
No ratings yet
Project Work
Document36 pages
Project Work
Orah Seun
No ratings yet
Big Data Manual - Edited
Document69 pages
Big Data Manual - Edited
Prabakaran Subramanian
No ratings yet
Design Principle For Big Data
Document4 pages
Design Principle For Big Data
Firdaus Adib
No ratings yet
Big Data: Concepts, Techniques, Storage and Challenges
Document9 pages
Big Data: Concepts, Techniques, Storage and Challenges
Rasmika Selvam
No ratings yet
Research Paper On Data Mining 2016
Document7 pages
Research Paper On Data Mining 2016
jizogol1siv3
100% (1)
Deep Learning
Document107 pages
Deep Learning
Mukund Tiwari
No ratings yet
Hand Book: Ahmedabad Institute of Technology
Document103 pages
Hand Book: Ahmedabad Institute of Technology
Bhavik Sanghar
No ratings yet
Big Data
Document3 pages
Big Data
nam trần
No ratings yet
Big Data
Document6 pages
Big Data
Divyasri
No ratings yet
Big Data: Spot Business Trends, Prevent Diseases, C Ombat Crime and So On"
Document8 pages
Big Data: Spot Business Trends, Prevent Diseases, C Ombat Crime and So On"
Renuka Pandey
No ratings yet
Document
Document5 pages
Document
yusufmuhammadii013
No ratings yet
1+-+Introduction+to+Data+Science
Document28 pages
1+-+Introduction+to+Data+Science
smallikarjun713
No ratings yet
Data Engineering
Document48 pages
Data Engineering
saisuvarnayatham
No ratings yet
Mtech Scheme
Document54 pages
Mtech Scheme
sirishaksnlp
No ratings yet
Big Data Seminar
Document27 pages
Big Data Seminar
Alemayehu Getachew
100% (2)
Data Science Vs Big Data
Document34 pages
Data Science Vs Big Data
poi.tamrakar
No ratings yet
03 - Data Engineering
Document5 pages
03 - Data Engineering
Laura Saglieti
No ratings yet
Research Papers On Big Data 2014 PDF
Document7 pages
Research Papers On Big Data 2014 PDF
qghzqsplg
100% (1)
Big Data Analytics: Free Guide: 5 Data Science Tools To Consider
Document8 pages
Big Data Analytics: Free Guide: 5 Data Science Tools To Consider
Keeme
No ratings yet
R Programming UNIT-1
Document48 pages
R Programming UNIT-1
padma
No ratings yet
Big Datapptfina1
Document25 pages
Big Datapptfina1
vedang patel
No ratings yet
AD3491 UNIT 1 NOTES EduEngg
Document35 pages
AD3491 UNIT 1 NOTES EduEngg
Aravind Samy
No ratings yet
Unit 1 Data Science and Big Data
Document23 pages
Unit 1 Data Science and Big Data
Pranav Sai Aditya
No ratings yet
Unit-III CC&BD Cs62 Ab
Document85 pages
Unit-III CC&BD Cs62 Ab
dhanrajpandya26
No ratings yet
04 Data Mining-Applications
Document6 pages
04 Data Mining-Applications
Raj Endran
No ratings yet
Towards Methods For Systematic Research On Big Data
Document10 pages
Towards Methods For Systematic Research On Big Data
Rosa Quelal Mora
No ratings yet
A Seminar Report: Big Data
Document22 pages
A Seminar Report: Big Data
lavhack
No ratings yet
Big Data Analytics: Recent Achievements and New Challenges
Document5 pages
Big Data Analytics: Recent Achievements and New Challenges
ATS
No ratings yet
QB Bda Solution
Document46 pages
QB Bda Solution
Avinash
No ratings yet
Big Data
Document16 pages
Big Data
Shruti Patawar
No ratings yet
Big Data Analytics Using Apache Hadoop
Document33 pages
Big Data Analytics Using Apache Hadoop
AbinBabyElichirayil
No ratings yet
Data Profiling Screen
Document4 pages
Data Profiling Screen
zipzapdhoom
No ratings yet
Business Intelligence Exam II Answers
Document24 pages
Business Intelligence Exam II Answers
tuchi
0% (1)
Fda 1
Document5 pages
Fda 1
noopur jadhav
No ratings yet
Unit 2 Da
Document69 pages
Unit 2 Da
aadityapawar210138
No ratings yet
Bigdata Documentation
Document20 pages
Bigdata Documentation
Babu Giri
No ratings yet
Review Paper On Big Data Analytics in Cloud Computing: July 2017
Document6 pages
Review Paper On Big Data Analytics in Cloud Computing: July 2017
Ogbodu Ejiro Desmond
No ratings yet
Chapter-1-2, EMC DSA Notes
Document8 pages
Chapter-1-2, EMC DSA Notes
akragnarock
No ratings yet
Foundation of Data Science
Document143 pages
Foundation of Data Science
JANILA J.
100% (2)
Sns College of Engineering: Big Data Analytics
Document17 pages
Sns College of Engineering: Big Data Analytics
rajianand2
No ratings yet
Lecture 1
Document21 pages
Lecture 1
Muhammad Akhtar
No ratings yet
Big Data Analytics
Document73 pages
Big Data Analytics
ayushgoud1234
No ratings yet
Da Unit-1
Document23 pages
Da Unit-1
jaganbecs
No ratings yet
Various Big Data Tools
Document33 pages
Various Big Data Tools
Vishal Gupta
No ratings yet
Introduction To Big Data BS (CS) 6 Lecture # 2: Dr. Syed Attique Shah (PH.D.)
Document28 pages
Introduction To Big Data BS (CS) 6 Lecture # 2: Dr. Syed Attique Shah (PH.D.)
Ahsan Iqbal
No ratings yet
What Is Big Data ?
Document6 pages
What Is Big Data ?
Meet Mahida
No ratings yet
Big Data - Iv Bda
Document143 pages
Big Data - Iv Bda
Jefferson Aaron
No ratings yet
Unit 1
Document14 pages
Unit 1
heimmer369
No ratings yet
Internet Technologies: By: Nandish Rao A
Document16 pages
Internet Technologies: By: Nandish Rao A
malini.kuppuraj1802
No ratings yet
Issues in Information Systems: Big Data Analytics
Document10 pages
Issues in Information Systems: Big Data Analytics
Luis Felipe García Diaz
No ratings yet
CS-701 BigDataHadoop Unit-1
Document23 pages
CS-701 BigDataHadoop Unit-1
efsadf
No ratings yet
Jsaer2016 03 01 21 24
Document4 pages
Jsaer2016 03 01 21 24
jsaereditor
No ratings yet
Navigating Big Data Analytics: Strategies for the Quality Systems Analyst
From Everand
Navigating Big Data Analytics: Strategies for the Quality Systems Analyst
William D. Mawby
No ratings yet
Manual,: TAPS Traction Auxiliary Power Supply
Document43 pages
Manual,: TAPS Traction Auxiliary Power Supply
ElputoAmo XD
No ratings yet
Kahoot!: Kahoot! Property of STI Page 1 of 5
Document5 pages
Kahoot!: Kahoot! Property of STI Page 1 of 5
Klarisi Vidal
No ratings yet
Untitled
Document2 pages
Untitled
Mohitrajranikashyap
No ratings yet
Quick Charge Device List
Document22 pages
Quick Charge Device List
PuRe Sp3ctre
No ratings yet
Ivo Ngala
Document3 pages
Ivo Ngala
ashok
No ratings yet
Online Lawyers Case Management System
Document7 pages
Online Lawyers Case Management System
சக்திவேல் கந்தசாமி
No ratings yet
FM - Get - SNP - Qty
Document3 pages
FM - Get - SNP - Qty
munaf
No ratings yet
ROS Cheat Sheet Melodic
Document1 page
ROS Cheat Sheet Melodic
matiasgramos
No ratings yet
Low-Noise Block Downconverter: Satellite Dishes Satellite TV
Document12 pages
Low-Noise Block Downconverter: Satellite Dishes Satellite TV
Getachew Mekonnen
No ratings yet
Cisco Certified Devnet Associate Training and Certification Program
Document2 pages
Cisco Certified Devnet Associate Training and Certification Program
JOHN
No ratings yet
RNS Institute of Technology, Bengaluru: (AICTE Approved, VTU Affiliated, NAAC A' Grade Accredited)
Document4 pages
RNS Institute of Technology, Bengaluru: (AICTE Approved, VTU Affiliated, NAAC A' Grade Accredited)
Rakesh R
No ratings yet
Led TV : MFL718464322202REV01
Document25 pages
Led TV : MFL718464322202REV01
guja mate
No ratings yet
Getting Started Guide For IBM Rational Robot
Document3 pages
Getting Started Guide For IBM Rational Robot
geongeo
No ratings yet
Revised Techno Commercial Offer of CSSD Equipment, Grand Port Hospital, Wadala, 08.02.2022
Document20 pages
Revised Techno Commercial Offer of CSSD Equipment, Grand Port Hospital, Wadala, 08.02.2022
Pranali Mhatre
No ratings yet
Wireless Survey
Document272 pages
Wireless Survey
Joao P
No ratings yet
Cooling Problems and Thermal Issues in High Power Electronics - A Multi Faceted Design Approach
Document9 pages
Cooling Problems and Thermal Issues in High Power Electronics - A Multi Faceted Design Approach
iq option
No ratings yet
Esab Weldding Edw610d
Document19 pages
Esab Weldding Edw610d
Bassim Wagih
No ratings yet
Shivika Itt
Document36 pages
Shivika Itt
Shivika Garg
No ratings yet
Universal Analog Converter PDF
Document2 pages
Universal Analog Converter PDF
Margaret Daugherty
No ratings yet
Usability Best Practices
Document37 pages
Usability Best Practices
Mashal Pk
No ratings yet
Imaxem Profile - 2023
Document53 pages
Imaxem Profile - 2023
AHMED SAED
No ratings yet
Microstation Powerdraft: Drafting Software For Your Most Demanding Projects
Document2 pages
Microstation Powerdraft: Drafting Software For Your Most Demanding Projects
Annadasankar Bera
No ratings yet
101 Cdma Basics
Document34 pages
101 Cdma Basics
Abdul Khader
No ratings yet
Bahan Ajar OP-AMP Part 2
Document15 pages
Bahan Ajar OP-AMP Part 2
Hanif Alhafizh
No ratings yet
Revision History: Table of Contents
Document19 pages
Revision History: Table of Contents
imaarha
No ratings yet
SQL Data Definition: Database Systems Lecture 5 Natasha Alechina
Document26 pages
SQL Data Definition: Database Systems Lecture 5 Natasha Alechina
Rana Gaballah
No ratings yet
BIPUBLISHER
Document49 pages
BIPUBLISHER
Selim Tanrısever
No ratings yet
Fundamentals of Robot
Document11 pages
Fundamentals of Robot
Vishal
No ratings yet
Ersatzleilliste Hydraulikpumpe
Document3 pages
Ersatzleilliste Hydraulikpumpe
aliwa
No ratings yet