Professional Documents
Culture Documents
Mini Project
Mini Project
Mini Project
ra s
v i ve e w
D e - C o f N
id e a la ig h
n g o
a t h
E
u m t H g i a n S
G .K a e n a g s w - C
ic l T V i a r
. M Effi c ia e s ta l ye
rs r : S o
ti c l ka na
M ge a r e n F i
ag .V h.
hT CH ec
a s B .T
H
Timeline flow
1.Data Collection
2. The choice of
Modelling
Data Collection :
Streaming
Search Search k
Search Live Tweets
past most
recent fromTwitter
articles similar Streaming
tweets from articles API
clusters
Algorithm 2. ColdStart1: Searching Recent Tweets
Method:
Ta_search = { } ;
w = 60 * 12 //Timewindow for past tweets: 12h
Tw = SubsetTweets(D,w) //Retrieve all tweets within time window w.
for tweet belongs Tw do
for phrase belongs Qa do
If phrase belongs tweet then
//If tweet contains the keyphrase phrase.
Ta_search = Ta_search U (tweet)
break
Return Ta_search ;
The Choice of Modelling includes the following steps :