Professional Documents
Culture Documents
Improving Web Clustering by Cluster Selection: By-Vishal Rathore Regd. No. 0721215022 (+91) 9861084119
Improving Web Clustering by Cluster Selection: By-Vishal Rathore Regd. No. 0721215022 (+91) 9861084119
Improving Web Clustering by Cluster Selection: By-Vishal Rathore Regd. No. 0721215022 (+91) 9861084119
Web Clustering by
Cluster Selection
Web Search
Iterative Process
Solution
Identify and Present Implicit Clusters
3
Web Clustering
Search Results for: Jaguar 1 – 6 of 70,000,000
4. Jaguar
1.
Clusters Official worldwide
General information
web
from
siteBig
of Cats
Jaguar
Online.
Cars.
1. Car 6. Jaguar
2. Apple - --
MacDefenders
OS X of Wildlife
2. Animal The Apple
Size, appearance,
Mac OSlife
X product
span and
page.
diet.
Clean Pages
Rank/Select Clusters
6
car
5
mac os x
24
car model 10
12
Select Best N
9
Scores
Poor Cluster Quality Measure
Selection
Poor Coverage
Excessive Overlap
10
Incremental
Greedy
Look-ahead Protection
Evaluation Method
Gold Standard - Ideal Clustering
2 Searches and 2 Types of Input Data
Jaguar and Salsa
Snippets and Full Text
Precision
Cluster accuracy against the best matching ideal cluster
Recall
Coverage of ideal cluster in matched clusters
F-measure
Combination of precision and recall
16
Conclusions
ESTC has
A new cluster scoring
A new cluster selection algorithm
Future Work
Make improvements to other stages of STC
Particularly Combining Base Clusters