0% found this document useful (0 votes)

15 views

PageRank_2021

The document discusses PageRank, a link analysis algorithm used to rank web pages based on their importance determined by link structures. It covers various aspects of graph data, challenges in web search, and the mathematical formulation of PageRank, including its eigenvector formulation and the power iteration method for computation. The document emphasizes the significance of links as votes and the recursive nature of determining page importance.

Uploaded by

pvnkhanh.sdh242

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

PageRank_2021

Uploaded by

pvnkhanh.sdh242

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 55

PageRank

Thoai Nam
High Performance Compu2ng Lab (HPC Lab)
Faculty of Computer Science and Technology
HCMC University of Technology
HPC Lab-CSE-HCMUT 1
PageRank
§ Applica2ons
§ PageRank formula2on
§ Google PageRank

HPC Lab-CSE-HCMUT 2
Graph Data: Social Networks

Facebook social graph

4-degrees of separation [Backstrom-Boldi-Rosa-Ugander-Vigna, 2011]

HPC Lab-CSE-HCMUT 3
Graph Data: Media Networks

Connections between political blogs

Polarization of the network [Adamic-Glance, 2005]

HPC Lab-CSE-HCMUT 4
Graph Data: Information Nets

Citation networks and Maps of science

[Börner et al., 2012]

HPC Lab-CSE-HCMUT 5
Graph Data: Communication Nets

domain2

domain1

router

domain3

Internet
HPC Lab-CSE-HCMUT 6
Graph Data: Technological Networks

Seven Bridges of Königsberg

[Euler, 1735]
Return to the starting point by traveling each link of the graph once
and only once.

HPC Lab-CSE-HCMUT 7
Web as a Directed Graph

HPC Lab-CSE-HCMUT 8
Broad Question

§ How to organize the Web?

§ First try: Human curated
Web directories
§ Yahoo, DMOZ, LookSmart

§ Second try: Web Search

§ Informa2on Retrieval inves2gates:
Find relevant docs in a small and trusted set
§ Newspaper ar2cles, Patents, etc.

§ But: Web is huge, full of untrusted documents, random things, web spam, etc.

HPC Lab-CSE-HCMUT 9
Web Search: 2 Challenges

2 challenges of web search:

(1) Web contains many sources of informa2on
Who to “trust”?
• Trick: Trustworthy pages may point to each other!

(2) What is the “best” answer to query “newspaper”?

• No single right answer
• Trick: Pages that actually know about newspapers might all be poin2ng to many newspapers

HPC Lab-CSE-HCMUT 10
Ranking Nodes on the Graph

§ All web pages are not equally “important”

www.new-‐page.com vs. www.stanford.edu
§ There is large diversity in the web-‐graph
node connec2vity
Let’s rank the pages by the link structure!

HPC Lab-CSE-HCMUT 11
Link Analysis Algorithms
§ We will cover the following Link Analysis approaches for compu2ng
importance of nodes in a graph:
o PageRank
o Topic-‐Speciﬁc (Personalized) PageRank
o Web Spam Detec2on Algorithms

HPC Lab-CSE-HCMUT 12
Links as Votes
§ Idea: Links as votes
o Page is more important if it has more links
• In-‐coming links? Out-‐going links?

§ Think of in-‐links as votes:

o www.stanford.edu has 23,400 in-‐links
o www.new-‐page.com has 1 in-‐link

§ Are all in-‐links are equal?

o Links from important pages count more
o Recursive ques2on!
HPC Lab-CSE-HCMUT 13
Intuition (1)
§ Web pages are important if people visit them a lot
§ But we can’t watch everybody using the Web
§ A good surrogate for visi2ng pages is to assume people follow links
randomly
§ Leads to random surfer model:
o Start at a random page and follow random out-‐links repeatedly, from whatever
page you are at
o PageRank = limi2ng probability of being at a page. b
a = 2
b = 1 a d
a → b → d → a → c → d → ...
c = 1
d = 2 c
HPC Lab-CSE-HCMUT 14
Intuition (2)
§ Solve the recursive equa2on: “importance of a page = its share of the
importance of each of its predecessor pages”
o Equivalent to the random-‐surfer deﬁni2on of PageRank
§ Technically, importance = the principal eigenvector of the transi2on
matrix of the Web
o A few ﬁx-‐ups needed

NOTE: 𝒙 is an
eigenvector with the
corresponding
eigenvalue 𝝀 if:
𝑨𝒙 = 𝝀𝒙

HPC Lab-CSE-HCMUT 15
Example: PageRank Scores

A
B
3.3 C
38.4
34.3

D
E F
3.9
8.1 3.9

1.6
1.6 1.6 1.6
1.6
HPC Lab-CSE-HCMUT 16
Simple Recursive Formulation
§ Each link’s vote is propor2onal to the importance of its source page

§ If page j with importance rj has n out-‐links, each link gets rj / n votes

§ Page j ’s own importance is the sum of the votes on its in-‐links

i k
ri/3 rk/4
j rj/3
rj = ri/3 + rk/4
rj/3 rj/3

HPC Lab-CSE-HCMUT 17
PageRank: The “Flow” Model
§ A “vote” from an important page is worth more y/2

§ A page is important if it is pointed to by other y
important pages
a/2
§ Deﬁne a “rank” rj for page j y/2
m
ri a m
rj = ∑ a/2
i→ j di
“Flow” equa0ons:
di ... out-degree of node i ry = ry /2 + ra /2
ra = ry /2 + rm
rm = ra /2

HPC Lab-CSE-HCMUT 18
Solving the Flow Equations
§ 3 equa2ons, 3 unknowns, no constants Flow equa0ons:
o No unique solu2on ry = ry /2 + ra /2
o All solu2ons equivalent module the scale factor ra = ry /2 + rm
§ Addi2onal constraint forces uniqueness: rm = ra /2
o ry + ra + rm = 1
o Solu2on: ry = 2/5, ra = 2/5, rm = 1/5
§ Gaussian elimina2on method works for small examples, but we need a
beder method for large web-‐size graphs
§ We need a new formula2on!

HPC Lab-CSE-HCMUT 19
PageRank: Matrix formulation
§ Stochastic adjacency matrix 𝑴
• Let page 𝑖 has 𝑑𝑖 out-links
• If 𝑖→𝑗, then 𝑀𝑗𝑖 = 1/𝑑𝑖 else 𝑀𝑗𝑖 = 0
o 𝑴 is a column stochastic matrix
Ø Columns sum to 1 i j
§ Rank vector 𝒓:: vector with an entry per page ri rj
o 𝑟𝑖 is the importance score of page 𝑖: ∑𝑖 𝑟𝑖 = 1
§ The flow equations can be written
Out-‐going links of Page 𝑖
𝑟 = 𝑀. 𝑟 𝑖

𝑗 𝑟𝑗 𝑟𝑗
x =
In-‐coming links of Page j 𝑟𝑖
𝑀 𝑟 𝑟
HPC Lab-CSE-HCMUT 20
Eigenvector Formulation
• The flow equations can be written

• So the rank vector r is an eigenvector of the stochastic web

matrix M
• In fact, its first or principal eigenvector,
with corresponding eigenvalue 1
• Largest eigenvalue of M is 1 since M is
column stochastic (with non-negative entries) NOTE: 𝒙 is an
• We know r is unit length and each column of M eigenvector with the
sums to one, so corresponding
eigenvalue 𝝀 if:
𝑨𝒙 = 𝝀𝒙
• We can now efficiently solve for r!
The method is called Power iteration

HPC Lab-CSE-HCMUT 21
Example: Flow Equations & M

y a m
y y ½ ½ 0
a ½ 0 1
a m m 0 ½ 0

r = M·∙r
ry = ry /2 + ra /2
ra = ry /2 + rm y ½ ½ 0 y
rm = ra /2 a = ½ 0 1 a
m 0 ½ 0 m

HPC Lab-CSE-HCMUT 22
Eigenvector formulation
§ The flow equations can be written
𝒓 = 𝑀. 𝒓
§ So the rank vector 𝒓 is an eigenvector of the stochastic web matrix 𝑀
o Starting from any stochastic vector 𝒖,, the limit 𝑴(𝑴(... 𝑴(𝑴 𝒖))) is the long-term
distribution of the surfers.
o The math: limiting distribution = principal eigenvector of 𝑀 = PageRank
o Note: If 𝒓 is the limit of 𝑀𝑀...𝑀𝒖,, then 𝒓 satisfies the equation 𝒓 = 𝑴.𝒓,,

so 𝒓 is an eigenvector of 𝑴 with eigenvalue 1

NOTE: 𝒙 is an
§ We can now efficiently solve for 𝒓!! eigenvector with the
corresponding
The method is called Power iteration eigenvalue 𝝀 if:
𝑨𝒙 = 𝝀𝒙

HPC Lab-CSE-HCMUT 23
Power iteration method
§ Given a web graph with N nodes, where the nodes are pages and edges
are hyperlinks
§ Power iteration: a simple iterative scheme
• Suppose there are N web pages
• Initialize: r(0) = [1/N,...,1/N]T
• Iterate: r(t+1) = M · r(t) 𝒅𝒊 ... out-degree of node 𝒊

• Stop when |r(t+1) – r(t)|1 < ε

About 50 iterations is sufficient to estimate the limiting solution
|x|1 = ∑1≤i≤N|xi| is the L1 norm
Can use any other vector norm, e.g., Euclidean

HPC Lab-CSE-HCMUT 24
PageRank: How to solve?

HPC Lab-CSE-HCMUT 25
PageRank: How to solve?

HPC Lab-CSE-HCMUT 26
The Stationary Distribution (1)

HPC Lab-CSE-HCMUT 27
The Stationary Distribution (2)

HPC Lab-CSE-HCMUT 28
Existence and Uniqueness
§ A central result from the theory of random walks (a.k.a. Markov
processes):

For graphs that sa2sfy certain condi2ons, the sta2onary distribu2on

is unique and eventually will be reached no mader what is the ini2al
probability distribu2on at 2me t = 0

HPC Lab-CSE-HCMUT 29
PageRank:
The Google Formulation

HPC Lab-CSE-HCMUT 30
PageRank: Three Questions

(t )
( t +1) ri or
rj =∑ equivalently r = Mr
i→ j di

ØDoes this converge?

ØDoes it converge to what we want?
ØAre results reasonable?

HPC Lab-CSE-HCMUT 31
Does this converge?

(t )
( t +1) ri
a b rj =∑
i→ j di

• Example:
ra 1 0 1 0
=
rb 0 1 0 1
Iteration 0, 1, 2, …

HPC Lab-CSE-HCMUT 32
Does it converge to what we want?

(t )
( t +1) ri
a b rj =∑
i→ j di

• Example:
ra 1 0 0 0
=
rb 0 1 0 0
Iteration 0, 1, 2, …

HPC Lab-CSE-HCMUT 33
PageRank problems
(1) Dead ends: Some pages have no out-links
o Random walk has “nowhere” to go to
o Such pages cause importance to “leak out”
(2) Spider traps:
(all out-links are within the group)
o Random walk gets “stuck” in a trap
o And eventually spider traps absorb all importance

Dead end
Spider trap

HPC Lab-CSE-HCMUT 34
Problem: Spider Traps

HPC Lab-CSE-HCMUT 35
Solution: Teleports
§ The Google solution for spider traps: At each time step, the random surfer
has two options
• With prob. β, follow a link at random
• With prob. 1-β, jump to some random page
• β is typically in the range 0.8 to 0.9
§ Surfer will teleport out of spider trap within a few time steps

Dead end
Spider trap

HPC Lab-CSE-HCMUT 36
Problem: Dead Ends

HPC Lab-CSE-HCMUT 37
Solution: Always Teleport!
§ Teleports: Follow random teleport links with probability 1.0 from
dead-ends
• Adjust matrix accordingly

y y

a m a m
y a m y a m
y ½ ½ 0 y ½ ½ ⅓
a ½ 0 0 a ½ 0 ⅓
m 0 ½ 0 m 0 ½ ⅓

HPC Lab-CSE-HCMUT 38
Why Teleports Solve the Problem?
Why are dead-ends and spider traps a problem
and why do teleports solve the problem?
§ Spider-traps are not a problem, but with traps PageRank scores
are not what we want
o Solution: Never get stuck in a spider trap by teleporting out of it in a finite
number of steps
§ Dead-ends are a problem
o The matrix is not column stochastic so our initial assumptions are not met
o Solution: Make matrix column stochastic by always teleporting when there
is nowhere else to go

HPC Lab-CSE-HCMUT 39
Solution: Random Teleports

HPC Lab-CSE-HCMUT 40
The Google matrix
§ PageRank equation [Brin-Page, ‘98]

§ The Google Matrix A:

[1/N]NxN...N by N matrix where all
entries are 1/N

§ We have a recursive problem: 𝑟 = A . 𝑟

And the Power method still works!
§ What is β?
o In practice β = 0.8, 0.9 (jump every 5 steps on avg.)

HPC Lab-CSE-HCMUT 41
Random Teleports (β = 0.8)
M [1/N]NxN
7/15
y 1/2 1/2 0 1/3 1/3 1/3
0.8 1/2 0 0 + 0.2 1/3 1/3 1/3
0 1/2 1 1/3 1/3 1/3

y 7/15 7/15 1/15

13/15
a 7/15 1/15 1/15
7/15
a m 1/15 7/15 13/15
1/15
m
A

y 1/3 0.33 0.24 0.26 7/33

a = 1/3 0.20 0.20 0.18 ... 5/33
m 1/3 0.46 0.52 0.56 21/33
HPC Lab-CSE-HCMUT 42
How do we actually compute the
PageRank?

HPC Lab-CSE-HCMUT 43
Computing Page Rank
§ Key step is matrix-vector multiplication
• rnew = A · rold
§ Easy if we have enough main memory to hold A, rold, rnew
§ Say N = 1 billion pages A = β·∙M + (1-‐β) [1/N]NxN
• We need 4 bytes for each entry (say) ½ ½ 0 1/3 1/3 1/3
A = 0.8 ½ 0 0 +0.2 1/3 1/3 1/3
• 2 billion entries for vectors, approx 8GB 0 ½ 1 1/3 1/3 1/3

• Matrix A has N2 entries

7/15 7/15 1/15
• 1018 is a large number!
= 7/15 1/15 1/15
1/15 7/15 13/15

HPC Lab-CSE-HCMUT 44
Rearranging the Equation

HPC Lab-CSE-HCMUT 45
Sparse Matrix Formulation

HPC Lab-CSE-HCMUT 46
PageRank: The Complete Algorithm

HPC Lab-CSE-HCMUT 47
Sparse Matrix Encoding
§ Encode sparse matrix using only nonzero entries
• Space proportional roughly to number of links
• Say 10N, or 4*10*1 billion = 40GB
• Still won’t fit in memory, but will fit on disk

source
node degree destination nodes
0 3 1, 5, 7
1 5 17, 64, 113, 117, 245
2 2 13, 23

HPC Lab-CSE-HCMUT 48
Basic Algorithm: Update Step
§ Assume enough RAM to fit rnew into memory
o Store rold and matrix M on disk
§ 1 step of power-iteration is:
Initialize all entries of rnew = (1-β) / N
For each page i (of out-degree di):
Read into memory: i, di, dest1, …, destdi, rold(i)
For j = 1…di
rnew(destj) += β rold(i) / di
0 rnew source degree destination rold 0
1 1
0 3 1, 5, 6
2 2
3 1 4 17, 64, 113, 117 3
4 4
2 2 13, 23
5 5
6 HPC Lab-CSE-HCMUT
6 49
Analysis
§ Assume enough RAM to fit rnew into memory
• Store rold and matrix M on disk
§ In each iteration, we have to:
• Read rold and M
• Write rnew back to disk
• Cost per iteration of Power method:
= 2|r| + |M|
§ Question:
• What if we could not even fit rnew in memory?

HPC Lab-CSE-HCMUT 50
Block-based Update Algorithm

rnew src degree des0na0on rold

0 0
0 4 0, 1, 3, 5 1
1
1 2 0, 5 2
2 3
2 2 3, 4 4
3
M 5
4
5

§ Break rnew into k blocks that fit in memory

§ Scan M and rold once for each block

HPC Lab-CSE-HCMUT 51
Analysis of Block Update
§ Similar to nested-loop join in databases
• Break rnew into k blocks that fit in memory
• Scan M and rold once for each block
§ Total cost:
• k scans of M and rold
• Cost per iteration of Power method:
k(|M| + |r|) + |r| = k|M| + (k+1)|r|
§ Can we do better?
• Hint: M is much bigger than r (approx 10-20x), so we must avoid
reading it k times per iteration

HPC Lab-CSE-HCMUT 52
Block-Stripe Update Algorithm
src degree desXnaXon
rnew
0 4 0, 1
0
1 1 3 0 rold
0
2 2 1 1
2
2 0 4 3 3
4
3
2 2 3 5

0 4 5
4
5
1 3 5
2 2 4
Break M into stripes! Each stripe contains only destination nodes in the
corresponding block of rnew
HPC Lab-CSE-HCMUT 53
Block-Stripe Analysis
§ Break M into stripes
• Each stripe contains only destination nodes
in the corresponding block of rnew
§ Some additional overhead per stripe
• But it is usually worth it
§ Cost per iteration of Power method:
=|M|(1+ε) + (k+1)|r|

HPC Lab-CSE-HCMUT 54
Some Problems with Page Rank
§ Measures generic popularity of a page
• Biased against topic-specific authorities
• Solution: Topic-Specific PageRank (next)
§ Uses a single measure of importance
• Other models of importance
• Solution: Hubs-and-Authorities
§ Susceptible to Link spam
• Artificial link topographies created in order to boost page rank
• Solution: TrustRank

HPC Lab-CSE-HCMUT 55

HSE-000-For-0005 Preliminary Incident Report Form
100% (3)
HSE-000-For-0005 Preliminary Incident Report Form
1 page
Lecture9
No ratings yet
Lecture9
64 pages
Advanced Analysis of Algorithms: Dept of CS & IT University of Sargodha
No ratings yet
Advanced Analysis of Algorithms: Dept of CS & IT University of Sargodha
51 pages
Lecture11_PageRank_V0
No ratings yet
Lecture11_PageRank_V0
38 pages
09 Pagerank
No ratings yet
09 Pagerank
61 pages
14-link-1 - converted
No ratings yet
14-link-1 - converted
10 pages
04 Pagerank
No ratings yet
04 Pagerank
64 pages
CSF-469-L11-13 (Link Analysis Page Rank)
No ratings yet
CSF-469-L11-13 (Link Analysis Page Rank)
47 pages
Assignment5 NLA Aug2023
No ratings yet
Assignment5 NLA Aug2023
7 pages
6 Pagerank
No ratings yet
6 Pagerank
7 pages
Link Analysis 1
No ratings yet
Link Analysis 1
101 pages
Google Pagerank: Maths Delivers!
No ratings yet
Google Pagerank: Maths Delivers!
24 pages
Dbms Review-3: G.BALAVIGNESH-10MSE1072 Harshavardhan-10Mse1077
No ratings yet
Dbms Review-3: G.BALAVIGNESH-10MSE1072 Harshavardhan-10Mse1077
35 pages
Jeffrey D. Ullman Stanford University
No ratings yet
Jeffrey D. Ullman Stanford University
55 pages
Page Rank Algorithm
No ratings yet
Page Rank Algorithm
9 pages
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
No ratings yet
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
44 pages
ch05 Linkanalysis1
No ratings yet
ch05 Linkanalysis1
60 pages
CS345 Data Mining: Link Analysis Algorithms Page Rank
No ratings yet
CS345 Data Mining: Link Analysis Algorithms Page Rank
37 pages
TM3 ch05 Link Analysis
No ratings yet
TM3 ch05 Link Analysis
69 pages
20241017_page_rank
No ratings yet
20241017_page_rank
29 pages
Page Rank PDF
0% (1)
Page Rank PDF
20 pages
Technical University of Ilmenau Institute For Theoretical and Technical Computer Science Automata and Formal Languages
No ratings yet
Technical University of Ilmenau Institute For Theoretical and Technical Computer Science Automata and Formal Languages
19 pages
Report PDF
No ratings yet
Report PDF
35 pages
Power Point
No ratings yet
Power Point
77 pages
PMBD-07-Link Analysis
No ratings yet
PMBD-07-Link Analysis
42 pages
Lab 4-2
No ratings yet
Lab 4-2
4 pages
The Pagerank and HITS Algorithms
No ratings yet
The Pagerank and HITS Algorithms
22 pages
Cse535 Link Analysis
No ratings yet
Cse535 Link Analysis
19 pages
Datamining-Lect7 - Link Analysis Ranking PageRank - Random Walks HITS Absorbing Random Walks and Label Propagation
No ratings yet
Datamining-Lect7 - Link Analysis Ranking PageRank - Random Walks HITS Absorbing Random Walks and Label Propagation
99 pages
Lect 14-Web Ranking
No ratings yet
Lect 14-Web Ranking
30 pages
Module 4 MapReduce and Link Analysis
No ratings yet
Module 4 MapReduce and Link Analysis
103 pages
Distributed Computing Seminar: Lecture 5: Graph Algorithms & Pagerank
No ratings yet
Distributed Computing Seminar: Lecture 5: Graph Algorithms & Pagerank
33 pages
Information Retrieval: Unit 4: Web Search - Part 1
No ratings yet
Information Retrieval: Unit 4: Web Search - Part 1
63 pages
The Pagerank Problem: Mathematical Theory and Computational Techniques
No ratings yet
The Pagerank Problem: Mathematical Theory and Computational Techniques
43 pages
Google Pagerank and Reduced-Order Modelling
No ratings yet
Google Pagerank and Reduced-Order Modelling
56 pages
The Linear Algebra Behind Google'S Pagerank Algorithm: Sujit Dunga 11110102
No ratings yet
The Linear Algebra Behind Google'S Pagerank Algorithm: Sujit Dunga 11110102
6 pages
lec10
No ratings yet
lec10
14 pages
Page Rank Algorithm
No ratings yet
Page Rank Algorithm
18 pages
pagerank
No ratings yet
pagerank
3 pages
Mini-Project #3 - Pagerank: 1 Motivation
No ratings yet
Mini-Project #3 - Pagerank: 1 Motivation
3 pages
1.1 Pagerank Description
No ratings yet
1.1 Pagerank Description
19 pages
The $25 Billion Eigenvector
No ratings yet
The $25 Billion Eigenvector
11 pages
Extrapolation Methods For Accelerating Pagerank Computations
No ratings yet
Extrapolation Methods For Accelerating Pagerank Computations
45 pages
Module 6-: Real Time Big Data Models
No ratings yet
Module 6-: Real Time Big Data Models
58 pages
Link Analysis: (Follow The Links To Learn More!)
No ratings yet
Link Analysis: (Follow The Links To Learn More!)
28 pages
PageRank Algorithm - The Mathematics of Google Search
No ratings yet
PageRank Algorithm - The Mathematics of Google Search
8 pages
Lecture 12 - Link Analysis
No ratings yet
Lecture 12 - Link Analysis
57 pages
An Analytical Comparison of Approaches To Personalizing Pagerank
No ratings yet
An Analytical Comparison of Approaches To Personalizing Pagerank
4 pages
Lecture 4
No ratings yet
Lecture 4
3 pages
The Use of The Linear Algebra by Web Search Engines
No ratings yet
The Use of The Linear Algebra by Web Search Engines
5 pages
Lec 31
No ratings yet
Lec 31
15 pages
De Kerchove NV07
No ratings yet
De Kerchove NV07
15 pages
LinAlgPaperFinal2 Screen
No ratings yet
LinAlgPaperFinal2 Screen
12 pages
Link Analysis
No ratings yet
Link Analysis
43 pages
Page Rank With 13 Cases
No ratings yet
Page Rank With 13 Cases
72 pages
pracEx05
No ratings yet
pracEx05
23 pages
Social Network Analysis
No ratings yet
Social Network Analysis
28 pages
feb_28
No ratings yet
feb_28
12 pages
Search Engines and SEO (IT302)
No ratings yet
Search Engines and SEO (IT302)
42 pages
Building a Product Master
From Everand
Building a Product Master
Edufdev
No ratings yet
Ian Talks Algos & Data Structures A-Z: WebDevAtoZ, #2
From Everand
Ian Talks Algos & Data Structures A-Z: WebDevAtoZ, #2
Ian Eress
No ratings yet
Western_Esotericism_A_Brief_History_of_S
No ratings yet
Western_Esotericism_A_Brief_History_of_S
53 pages
Yevenes Et Al., 2019
No ratings yet
Yevenes Et Al., 2019
13 pages
Supplier Accreditation Form
No ratings yet
Supplier Accreditation Form
31 pages
Lesson Plans (Unit 6 Healthy Habits)
50% (2)
Lesson Plans (Unit 6 Healthy Habits)
8 pages
08 01 ViDiEL Classify ISVS Standard
No ratings yet
08 01 ViDiEL Classify ISVS Standard
8 pages
Garrett_Performamce_VW_2.0_TSI_Piping_Install (1)
No ratings yet
Garrett_Performamce_VW_2.0_TSI_Piping_Install (1)
27 pages
QP4_BRN32
No ratings yet
QP4_BRN32
7 pages
A Short History Of Chess Davidson Henry Alexander download
No ratings yet
A Short History Of Chess Davidson Henry Alexander download
17 pages
Re:zero Lust IF
No ratings yet
Re:zero Lust IF
35 pages
DLP Gas Exchange in Plants and Animals - Q4 WK4 COT SAN CARLOS DIVISION
100% (3)
DLP Gas Exchange in Plants and Animals - Q4 WK4 COT SAN CARLOS DIVISION
8 pages
Instant ebooks textbook Organic Ferroelectric Materials and Applications (Woodhead Publishing Series in Electronic and Optical Materials) 1st Edition Kamal Asadi (Editor) download all chapters
100% (3)
Instant ebooks textbook Organic Ferroelectric Materials and Applications (Woodhead Publishing Series in Electronic and Optical Materials) 1st Edition Kamal Asadi (Editor) download all chapters
55 pages
Rosa Maria Aguado: Top-Rated Freelance Graphic Designer
No ratings yet
Rosa Maria Aguado: Top-Rated Freelance Graphic Designer
15 pages
Charismatic Movement History
No ratings yet
Charismatic Movement History
366 pages
Color Palette 002: Color 1 Color 2 Color 3 Color 4 Color 5
No ratings yet
Color Palette 002: Color 1 Color 2 Color 3 Color 4 Color 5
4 pages
How to Teach Writing
No ratings yet
How to Teach Writing
14 pages
Austroads Guide Bridge Technology Part5 AGBT05 09
No ratings yet
Austroads Guide Bridge Technology Part5 AGBT05 09
82 pages
Pearson Distribution
No ratings yet
Pearson Distribution
11 pages
RC Phase Shift Oscillator Full Derivation
0% (1)
RC Phase Shift Oscillator Full Derivation
12 pages
Chapter 5 Worksheet
No ratings yet
Chapter 5 Worksheet
6 pages
LECTURA MÓDULO 2 Students' Perception On Plagiarism and Usage of Turnitin Anti-Plagiarism Software The Role of The Library
No ratings yet
LECTURA MÓDULO 2 Students' Perception On Plagiarism and Usage of Turnitin Anti-Plagiarism Software The Role of The Library
19 pages
Hillman Hunter Gt Hustler Royal 660 Nc
No ratings yet
Hillman Hunter Gt Hustler Royal 660 Nc
6 pages
Water And Sustainability In Arid Regions Bridging The Gap Between Physical And Social Sciences 1st Edition Du Zheng instant download
No ratings yet
Water And Sustainability In Arid Regions Bridging The Gap Between Physical And Social Sciences 1st Edition Du Zheng instant download
87 pages
Innov27.1 Web Watermark 36-43
No ratings yet
Innov27.1 Web Watermark 36-43
8 pages
AFPE Palzer Kaufmann PDF
No ratings yet
AFPE Palzer Kaufmann PDF
663 pages
Impact of Television Advertisement On The Buying Behaviour of FMCG Customers in Coimbatore District: A Study
No ratings yet
Impact of Television Advertisement On The Buying Behaviour of FMCG Customers in Coimbatore District: A Study
14 pages
D-0001-FAB-0001_Rev. 00_Laydown General Arrangement Plan
No ratings yet
D-0001-FAB-0001_Rev. 00_Laydown General Arrangement Plan
2 pages
Digital Vlsi Chip Design With Cadence and Synopsys Cad Tools Erik Brunvand P 311051
No ratings yet
Digital Vlsi Chip Design With Cadence and Synopsys Cad Tools Erik Brunvand P 311051
4 pages
Duplex-Pneumatic Torque Wrench
No ratings yet
Duplex-Pneumatic Torque Wrench
1 page
Bhrigu-Bindu - The Mysterious System of 'Nadi Jyotisha'
100% (1)
Bhrigu-Bindu - The Mysterious System of 'Nadi Jyotisha'
3 pages

PageRank_2021

Uploaded by

PageRank_2021

Uploaded by

PageRank

Facebook social graph

Connections between political blogs

Citation networks and Maps of science

Seven Bridges of Königsberg

§ How to organize the Web?

§ Second try: Web Search

2 challenges of web search:

(2) What is the “best” answer to query “newspaper”?

§ All web pages are not equally “important”

§ Think of in-­‐links as votes:

§ Are all in-­‐links are equal?

• So the rank vector r is an eigenvector of the stochastic web

so 𝒓 is an eigenvector of 𝑴 with eigenvalue 1

• Stop when |r(t+1) – r(t)|1 < ε

For graphs that sa2sfy certain condi2ons, the sta2onary distribu2on

ØDoes this converge?

§ The Google Matrix A:

§ We have a recursive problem: 𝑟 = A . 𝑟

y 7/15 7/15 1/15

y 1/3 0.33 0.24 0.26 7/33

• Matrix A has N2 entries

rnew src degree des0na0on rold

§ Break rnew into k blocks that fit in memory

You might also like

§ Think of in-‐links as votes:

§ Are all in-‐links are equal?