Professional Documents
Culture Documents
تعديل ch5
تعديل ch5
تعديل ch5
Chapter 5
Predictive Analytics II: Text,
Web, and Social Media
Analytics …
Knowledge
Feedback Feedback
The inputs to the process The output of Task 1 is a The output of Task 2 is a flat The output of Task 3 is a
include a variety of relevant collection of documents in file called term-document number of problem-specific
unstructured (and semi- some digitized format for matrix where the cells are classification, association,
structured) data sources such as computer processing populated with the term clustering models and
text, XML, HTML, etc. frequencies visualizations
Sentiment A statement
Analysis Step 1
Calculate the
Process Lexicon
O – S Polarity
Is there a
No sentiment? Yes
O–S
polarity
Yes measure
Step 2
Calculate the N – P N-P Polarity
Polarity of the
sentiment
Lexicon Record the Polarity,
Strength, and the
Target of the
Step 3 sentiment.
Step 4
Tabulate & aggregate
the sentiment
analysis results
Page Rank Information Retrieval Graph Mining Social Analytics Clickstream Analysis
Search Engine Optimization Social Network Analysis Social Media Analytics Weblog Analysis
Marketing Attribution Customer Analytics 360 Customer View Voice of the Customer
Metadata
Index
Cashed / Indexed
Documents DB
User Responding Cycle Development Cycle
World Wide Web
ed
tch Pro sse
d
Or Rank a
f M es ce c e s
de Document Document pro Page
red ed- to
Lis Pag Pag ssed n
Pa es U eb
ges Matcher/Ranker Indexer W
1. Development Cycle
– Web Crawler
is a piece of software that systematically browses (crawls through) the
World Wide Web for the purpose of finding and fetching Web pages.
– Document Indexer
The document indexer is responsible for processing the documents
(Web pages or document files) and placing them into the document
database.
2. Response Cycle
– Query Analyzer
– Document Matcher/Ranker
Slide 5-28 Copyright © 2018 Pearson Education Ltd.
Search Engine Optimization
• It is the intentional activity of affecting the visibility
of an e-commerce site or a Web site in a search
engine’s natural (unpaid or organic) search results
• Part of an Internet marketing strategy
• Based on knowing how a Search Engine works
– Content, HTML, keywords, external links, …
• Indexing based on …
– Webmaster submission of URL
– Proactively and continuously crawling the Web
Critics
Joiners
Collectors
Spectators
Inactives
Time
Slide 5-36 Copyright © 2018 Pearson Education Ltd.
End of Chapter 5
• Questions / Comments