Welcome to Scribd!

Specific Page Rank

Uploaded by

0% found this document useful (0 votes)

5 views6 pages

The document discusses topic-specific PageRank, which is a way to measure the popularity of web pages within a particular topic. It does this by biasing the random walk of the standard PageRank algorithm to preferentially teleport to a set of pages relevant to the topic. This allows search results to be ranked based on both popularity and topical relevance. The algorithm works by updating the teleportation step of PageRank to a topic-specific set of pages. Examples are given of how different teleport sets impact the resulting PageRank values.

Original Description:

Original Title

specific_page_rank

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

5 views6 pages

Specific Page Rank

Uploaded by

justin courtois

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 6

Search inside document

Mining of Massive Datasets

Leskovec, Rajaraman, and Ullman

Stanford University
Instead of generic popularity, can we
measure popularity within a topic?
Goal: Evaluate Web pages not just according
to their popularity, but by how close they are
to a particular topic, e.g. “sports” or “history”
Allows search queries to be answered based
on interests of the user
Example: Query “Trojan” wants different pages
depending on whether you are interested in
sports, history and computer security

J. Leskovec, A. Rajaraman, J. Ullman (Stanford University) Mining of Massive Datasets 63

Random walker has a small probability of
teleporting at any step
Teleport can go to:
Standard PageRank: Any page with equal probability
To avoid dead-end and spider-trap problems
Topic Specific PageRank: A topic-specific set of
“relevant” pages (teleport set)
Idea: Bias the random walk
When walker teleports, she pick a page from a set S
S contains only pages that are relevant to the topic
E.g., Open Directory (DMOZ) pages for a given topic/query
For each teleport set S, we get a different vector rS
J. Leskovec, A. Rajaraman, J. Ullman (Stanford University) Mining of Massive Datasets 64
To make this work all we need is to update the
teleportation part of the PageRank formulation:
= +( )/| | if
otherwise
A is stochastic!
We weighted all pages in the teleport set S equally
Could also assign different weights to pages!
Random Walk with Restart: S is a single element
Compute as for regular PageRank:
Multiply by M, then add a vector
Maintains sparseness

J. Leskovec, A. Rajaraman, J. Ullman (Stanford University) Mining of Massive Datasets 65

0.2 Suppose S = {1}, = 0.8
0.5 1 0.5
Node Iteration
0.4 0.4 0 1 2 stable
1 1 0.25 0.4 0.28 0.294
2 0.8
3 2 0.25 0.1 0.16 0.118
1 3 0.25 0.3 0.32 0.327
1
4 0.25 0.2 0.24 0.261
0.8 0.8

4 S={1,2,3,4}, =0.8:
r=[0.13, 0.10, 0.39, 0.36]
S={1}, =0.90: S={1,2,3} , =0.8:
r=[0.17, 0.07, 0.40, 0.36] r=[0.17, 0.13, 0.38, 0.30]
S={1} , =0.8: S={1,2} , =0.8:
r=[0.29, 0.11, 0.32, 0.26] r=[0.26, 0.20, 0.29, 0.23]
S={1}, =0.70: S={1} , =0.8:
r=[0.39, 0.14, 0.27, 0.19] r=[0.29, 0.11, 0.32, 0.26]
J. Leskovec, A. Rajaraman, J. Ullman (Stanford University) Mining of Massive Datasets 66
Create different PageRanks for different topics
The 16 DMOZ top-level categories:
arts, business, sports,…
Which topic ranking to use?
User can pick from a menu
Classify query into a topic
Can use the context of the query
E.g., query is launched from a web page talking about a
known topic
History of queries e.g., “basketball” followed by “Jordan”
User context, e.g., user’s bookmarks, …
J. Leskovec, A. Rajaraman, J. Ullman (Stanford University) Mining of Massive Datasets 67

Cisco Viptela System Properties - IP With Ease
Document2 pages
Cisco Viptela System Properties - IP With Ease
mvsnkrao
No ratings yet
Computer Network Answers
Document4 pages
Computer Network Answers
ammad ahmad
No ratings yet
Likelihood Function: Coin Landing Heads-Up
Document8 pages
Likelihood Function: Coin Landing Heads-Up
Ivan Carbalo
No ratings yet
Tarea 2
Document6 pages
Tarea 2
Luis Angel Arevalo Gomez
No ratings yet
Ejercicios Minima R
Document8 pages
Ejercicios Minima R
Jose Francisco Ovando Alvarado
No ratings yet
Ejercicios Minima R
Document8 pages
Ejercicios Minima R
Jose Francisco Ovando Alvarado
No ratings yet
Ejercicios MM
Document8 pages
Ejercicios MM
Jose Francisco Ovando Alvarado
No ratings yet
HW 2 CHAP 2 - Yepez
Document9 pages
HW 2 CHAP 2 - Yepez
Elvira
No ratings yet
Properties of Section Described by Nodes Coordinates - Girder End
Document5 pages
Properties of Section Described by Nodes Coordinates - Girder End
Alok Mehta
No ratings yet
EEE 4601 (Signals & Systems)
Document84 pages
EEE 4601 (Signals & Systems)
AO
No ratings yet
Data Pesawat 1
Document3 pages
Data Pesawat 1
Irma Rohmatillah
No ratings yet
Graphing Data
Document8 pages
Graphing Data
mpho
No ratings yet
Onda Sinusoidale
Document6 pages
Onda Sinusoidale
api-27553218
No ratings yet
Markov
Document61 pages
Markov
blahblah435
No ratings yet
Data Science 01 - Basics
Document52 pages
Data Science 01 - Basics
lomeroaia
No ratings yet
Pset 2 Solutions 3.044 2013: Problem 1
Document11 pages
Pset 2 Solutions 3.044 2013: Problem 1
Dak Kaiz
No ratings yet
Assignment 11
Document5 pages
Assignment 11
geetha megharaj
No ratings yet
Cluster Validation L25
Document25 pages
Cluster Validation L25
Roy Deep
No ratings yet
Microwave Optics (1) : Sleman Nabeel Sa'ad
Document10 pages
Microwave Optics (1) : Sleman Nabeel Sa'ad
Sleman Saad
No ratings yet
Reinforcement Learning: Part I - Definitions
Document26 pages
Reinforcement Learning: Part I - Definitions
jeanderiut
No ratings yet
Control Engineering-I Lab-1 Dated: 24-10-2007 1. What Is MATLAB
Document9 pages
Control Engineering-I Lab-1 Dated: 24-10-2007 1. What Is MATLAB
api-19807868
No ratings yet
Tabela de Lajes
Document9 pages
Tabela de Lajes
Karen Moreira
No ratings yet
11.cluster Validation PDF
Document37 pages
11.cluster Validation PDF
Tanmay Khandelwal
No ratings yet
Medical Image Steganography Using Nuclear Spin Generator 2020 Stoyanov
Document12 pages
Medical Image Steganography Using Nuclear Spin Generator 2020 Stoyanov
Sujith
No ratings yet
Mov en Una Dimension Graficas
Document4 pages
Mov en Una Dimension Graficas
Damarina
No ratings yet
Referat: Lucrarea de Laborator nr.3
Document7 pages
Referat: Lucrarea de Laborator nr.3
Mike Brash
No ratings yet
Geor Intro
Document24 pages
Geor Intro
Sílvio De Castro Silveira
No ratings yet
Engineers Guide To Matlab 3rd Edition Magrab Solutions Manual Full Chapter PDF
Document34 pages
Engineers Guide To Matlab 3rd Edition Magrab Solutions Manual Full Chapter PDF
anselmthangxu5eo0
100% (16)
Other Numerical Integration Examples
Document19 pages
Other Numerical Integration Examples
siswanto.mico
No ratings yet
Parallel
Document9 pages
Parallel
Sagar Rawal
No ratings yet
Sean Sithole Physics Cala A
Document4 pages
Sean Sithole Physics Cala A
Sean Sithole
No ratings yet
Engineers Guide To MATLAB 3rd Edition Magrab Solutions Manual instant download all chapter
Document36 pages
Engineers Guide To MATLAB 3rd Edition Magrab Solutions Manual instant download all chapter
bivahenje
100% (3)
Matriz de Rigidez Armadura - 3d Examen (Recuperado)
Document56 pages
Matriz de Rigidez Armadura - 3d Examen (Recuperado)
Andersson Aldair Ventura Fernández
No ratings yet
Chemical Kinetics Simulator: An Interactive Graphical Approach
Document8 pages
Chemical Kinetics Simulator: An Interactive Graphical Approach
nurul ismi
No ratings yet
Tabla de Frecuencias
Document14 pages
Tabla de Frecuencias
Ainer Shuan
No ratings yet
Lab 1
Document5 pages
Lab 1
Dafne
No ratings yet
FDMcode
Document9 pages
FDMcode
Viswanath Kapavarapu
No ratings yet
Practica 01 Lab.
Document8 pages
Practica 01 Lab.
Darla Fernanda Ledezma Romero
No ratings yet
EJEMPLO: Calcular Los Desplazamientos de La Siguiente Estructura
Document3 pages
EJEMPLO: Calcular Los Desplazamientos de La Siguiente Estructura
Alex Bustamante Lara
No ratings yet
Chapter 3 - Data Visualization Chapter 4 - Summary Statistics
Document38 pages
Chapter 3 - Data Visualization Chapter 4 - Summary Statistics
jay
No ratings yet
Solutions Manual For Matlab Based Electromagnetics by Branislav M Notaros 0132857944
Document3 pages
Solutions Manual For Matlab Based Electromagnetics by Branislav M Notaros 0132857944
ThomasVancemckn
98% (45)
DS-500 CCU Performance Analysis Via QS
Document7 pages
DS-500 CCU Performance Analysis Via QS
Ruslan Crab
No ratings yet
Matlab
Document57 pages
Matlab
mikyyeheyes
No ratings yet
Data Mining: Clustering Validation Minimum Description Length Information Theory Co-Clustering
Document67 pages
Data Mining: Clustering Validation Minimum Description Length Information Theory Co-Clustering
Arul Kumar Venugopal
No ratings yet
LSA, pLSA, and LDA Acronyms, Oh My!
Document114 pages
LSA, pLSA, and LDA Acronyms, Oh My!
Kausik Das
No ratings yet
Dti Fun On Scalar
Document11 pages
Dti Fun On Scalar
Fabian Scheipl
No ratings yet
Bending Beam Part 4 Result
Document5 pages
Bending Beam Part 4 Result
Syafiq Nasir
No ratings yet
Problem Set 1 Solution Numerical Methods
Document32 pages
Problem Set 1 Solution Numerical Methods
Ariyan Jahanyar
No ratings yet
물실1 03분반 2조 실험2
Document3 pages
물실1 03분반 2조 실험2
danso0429
No ratings yet
Sample Lab Report.1
Document5 pages
Sample Lab Report.1
Maooz Baloch
No ratings yet
WAFO - A MATLAB Toolbox For Random Waves and Loads: Sofia Aberg
Document21 pages
WAFO - A MATLAB Toolbox For Random Waves and Loads: Sofia Aberg
esttif02
No ratings yet
Ps Lab Expt 2
Document6 pages
Ps Lab Expt 2
Kaja Srinivas
No ratings yet
SPC Reinf.
Document59 pages
SPC Reinf.
niraj
No ratings yet
M (KG) LF (M) X (M) F (N) M: M (KG) t1(s) t2(s) t3(s) Tprom(s) T(S) T 2 (S 2)
Document2 pages
M (KG) LF (M) X (M) F (N) M: M (KG) t1(s) t2(s) t3(s) Tprom(s) T(S) T 2 (S 2)
DANIEL BERNAL GOMEZ
No ratings yet
Nash 1972
Document10 pages
Nash 1972
Dario
No ratings yet
Ao0c02088 Si 001
Document5 pages
Ao0c02088 Si 001
Lucas Palmeira
No ratings yet
Determination of Composition of Complexes Using Jobs Method (1) No
Document10 pages
Determination of Composition of Complexes Using Jobs Method (1) No
Ch Safdar Farukh
No ratings yet
Stat 890 Design of Computer Experiments
Document24 pages
Stat 890 Design of Computer Experiments
João Miguel
No ratings yet
Q1-Wave Equation Program
Document5 pages
Q1-Wave Equation Program
Ahmed Hwaidi
No ratings yet
Kaliti Roof EGA and LTZ Manual
Document17 pages
Kaliti Roof EGA and LTZ Manual
mihiretu Tefera
No ratings yet
Graphs and Tables of the Mathieu Functions and Their First Derivatives
From Everand
Graphs and Tables of the Mathieu Functions and Their First Derivatives
James C. Wiltse
No ratings yet
Summer Bridge Math, Grades 1 - 2
From Everand
Summer Bridge Math, Grades 1 - 2
Summer Bridge Activities
No ratings yet
Application To Meassuring Proximity in Graphs
Document8 pages
Application To Meassuring Proximity in Graphs
justin courtois
No ratings yet
How We Really Compute Pagerank
Document6 pages
How We Really Compute Pagerank
justin courtois
No ratings yet
Computing Pagerank On Big Graphs
Document9 pages
Computing Pagerank On Big Graphs
justin courtois
No ratings yet
Why Teleports Solve The Problem
Document8 pages
Why Teleports Solve The Problem
justin courtois
No ratings yet
Pagerank Power Iteration
Document6 pages
Pagerank Power Iteration
justin courtois
No ratings yet
Pagerank The Google Formulation
Document9 pages
Pagerank The Google Formulation
justin courtois
No ratings yet
Pagerank The Flow Formulation
Document6 pages
Pagerank The Flow Formulation
justin courtois
No ratings yet
Pagerank The Matrix Formulation
Document4 pages
Pagerank The Matrix Formulation
justin courtois
No ratings yet
Link Analysis and Pagerank
Document14 pages
Link Analysis and Pagerank
justin courtois
No ratings yet
IBM Endpoint Manager For Remote Control Installation Guide
Document144 pages
IBM Endpoint Manager For Remote Control Installation Guide
priteshj
No ratings yet
Ptstudyguide
Document1 page
Ptstudyguide
ANUJA
No ratings yet
30 Days To Niche Success
Document60 pages
30 Days To Niche Success
Prasanjeet Sikder
No ratings yet
UiPath Certified Professional - Automation Developer Associate Exam Description
Document9 pages
UiPath Certified Professional - Automation Developer Associate Exam Description
slavdani
No ratings yet
TNTR - IFHRMS - EMPLOYEES - PAYSLIP - USER MANUAL - ENGLISH - VERSION 1.1
Document15 pages
TNTR - IFHRMS - EMPLOYEES - PAYSLIP - USER MANUAL - ENGLISH - VERSION 1.1
uvaaraj
No ratings yet
W3schools: HTML5 Introduction
Document7 pages
W3schools: HTML5 Introduction
anon_723287062
No ratings yet
CSE1004 - Lab (G1+TG1 Slot) Assignment - 1: Submitted By: Submitted To: Abhishek Kandel (19BCE2629) Dr. Santhi H
Document13 pages
CSE1004 - Lab (G1+TG1 Slot) Assignment - 1: Submitted By: Submitted To: Abhishek Kandel (19BCE2629) Dr. Santhi H
theonlygod
No ratings yet
Intrusion Detection Method Based On SMOTE Transformation For Smart Grid Cybersecurity
Document6 pages
Intrusion Detection Method Based On SMOTE Transformation For Smart Grid Cybersecurity
Awishka Thuduwage
No ratings yet
3270-Article Text-11695-1-10-20210125
Document9 pages
3270-Article Text-11695-1-10-20210125
rido rido
No ratings yet
Log Mine
Document10 pages
Log Mine
Nischitha Chinnari
No ratings yet
Create 3D Boxes in Corel Draw PDF
Document37 pages
Create 3D Boxes in Corel Draw PDF
Rivai Ahmad
100% (1)
RAIDIX 4.7.2-5.0.1.3 Update Manual
Document11 pages
RAIDIX 4.7.2-5.0.1.3 Update Manual
Rufat Ibragimov
No ratings yet
Lecture 4-Network Topologies
Document17 pages
Lecture 4-Network Topologies
Nikhil Gaddam
No ratings yet
Soft Util - Nirsoft-Net
Document5 pages
Soft Util - Nirsoft-Net
N.E.G.
No ratings yet
3 Vivares v. St. Theresa's College
Document15 pages
3 Vivares v. St. Theresa's College
Francis Leo Tianero
No ratings yet
Tagging An Existing PDF in Adobe Acrobat DC
Document10 pages
Tagging An Existing PDF in Adobe Acrobat DC
Demian
No ratings yet
Caf-Sbo Action Plan
Document2 pages
Caf-Sbo Action Plan
Kimberly Rose Narciso
No ratings yet
Onenote 2010 Tutorial: Getting Started Guide
Document9 pages
Onenote 2010 Tutorial: Getting Started Guide
Neyer Leonel Vargas Padilla
No ratings yet
DBMS Assignment 4
Document13 pages
DBMS Assignment 4
Rohit Chavan
No ratings yet
iDS-7200HUHI-M2/S SERIES Turbo Acusense DVR: Key Feature
Document5 pages
iDS-7200HUHI-M2/S SERIES Turbo Acusense DVR: Key Feature
Firmansyah Cahaya Timoer
No ratings yet
Next Generation: Content Analytics Machine Learning Intelligent Capture
Document52 pages
Next Generation: Content Analytics Machine Learning Intelligent Capture
Adrian Pelivan
No ratings yet
Learn With AngularJS - Bootstrap - and ColdFusion - Jeffry Houser 155 PDF
Document155 pages
Learn With AngularJS - Bootstrap - and ColdFusion - Jeffry Houser 155 PDF
TrurlScribd
No ratings yet
NetNumen CORBA Interface Performance Indices
Document25 pages
NetNumen CORBA Interface Performance Indices
kamalgaihre
No ratings yet
Mos Word 2016 Objective
Document12 pages
Mos Word 2016 Objective
Đình Toàn
No ratings yet
Product Spec DH 725 Jan.2019
Document11 pages
Product Spec DH 725 Jan.2019
jun amaro
No ratings yet
Nat Practice Lab
Document1 page
Nat Practice Lab
aennek
No ratings yet
Explain How Dedicated Lines Communication Systems Are Used To Support The Internet - Daksha
Document6 pages
Explain How Dedicated Lines Communication Systems Are Used To Support The Internet - Daksha
Nishidanee Kalloo Faugoo
No ratings yet
Social Media Presentation-Nextdoor App-Edt180 1
Document14 pages
Social Media Presentation-Nextdoor App-Edt180 1
api-506651646
No ratings yet