SlideShare a Scribd company logo
Motivation When searching for information on the WWW, user perform a query to a search engine. The engine return, as the query’s result, a list of Web sites which usually is a huge set. So the ranking of these web sites is very important. Because much information is contained in the link-structure of the WWW, information such as which pages are linked to others can be used to augment search algorithms.
The Stochastic Approach for Link-Structure Analysis (SALSA) and the TKC Effect The PageRank Citation Ranking: Bringing Order to the Web
Paper 1----SALSA authorities: web pages that have many outlinks hubs: web pages that point to many authoritative sites Hubs and authorities form  communities , the most  prominent community  is called the  principal community .
SALSA----Idea SALSA is based upon the theory of Markov chains,  and relies on the stochastic properties of random walks  performed on our collection of sites. The input to our scheme consists of a collection of sites  C  which is built around a topic  t . Intuition  suggests that authoritative sites on topic  t  should be visible from many sites in the subgraph induced by  C .  Thus, a random walk on this subgraph will visit t -authorities with high probability.
SALSA----Idea Combine the theory of random walks with the notion  of the two distinct types of Web sites, hubs and  authorities, and actually analyze two different Markov  chains: A chain of hubs and a chain of authorities.  Analyzing both chains allows our approach to give each Web site two distinct scores, a hub score and an  authority score.
SALSA----Computing Now define two stochastic matrices, which are the  transition matrices of the two Markov chains at interest: The hub-matrix   H :  The authority-matrix   Ã   :
SALSA the principal community of authorities(hubs) found by the SALSA will be composed of the sites whose entries in the principal eigenvector of  A  ( H ) are the highest.
SALSA----Conclusion SALSA is a new stochastic approach for link structure analysis, which examines random walks on graphs derived from the link structure.  The principal community of authorities(hubs) corresponds to the sites that are most frequently visited by the random walk defined by the authority(hub) Markov chain.
The PageRank Citation Ranking: Bringing Order to the Web Larry Page etc. Stanford University
PageRank----Idea Every page has some number of forward links(outedges) and backlinks(inedges)
PageRank----Idea Two cases PageRank is interesting: Web pages vary greatly in terms of the number of backlinks they have. For example, the Netscape home page has 62,804 backlinks compared to most pages which have just a few backlinks. Generally, highly linked pages are more “important” than pages with few links.
PageRank----Idea Backlinks coming from important pages convey more importance to a page. For example, if a web page has a link off the yahoo home page, it may be just one link but it is a very important one. A page has high rank if the sum of the ranks of its backlinks is high. This covers both the case when a page has many backlinks and when a page has a few highly ranked backlinks.
PageRank----Definition u: a web page F u :  set of pages u points to  B u :  set of pages that point to u N u =|F u |:  the number of links from u  c: a factor used for normalization The equation is recursive, but it may be computed by starting with any set of ranks and iterating the computation until it converges.
PageRank----definition A problem with above definition:  rank sink If two web pages point to each other but to no other page, during the iteration, this loop will accumulate rank but  never distribute any rank.
PageRank----definition Definition modified: E(u) is some vector over the web pages(for example uniform, favorite page etc.) that corresponds to a source of rank.  E(u) is a user designed parameter.
PageRank----Random Surfer Model The definition corresponds to the probability distribution of a random walk on the web graphs.  E(u) can be thought as the random surfer gets bored periodically and jumps to a different page and not kept in a loop forever.
PageRank----Conclution PageRank is a global ranking based on the web's graph structure PageRank use backlinks information to bring order to the web PageRank can be thought of as random surfer model.
Compare----SALSA and PageRank Both ranking web page by link structure information. Both are based on the graph of the web. Both use random walk idea.

More Related Content

PPT
Pagerank (1)
marlon
 
PPTX
Course 100 introduction to performance canvas
Jennifer Howell
 
PPT
Pagerank (1)
marlon
 
PDF
Embedding BI
Jennifer Howell
 
PPT
Mantenimiento
Cindhy Celi
 
PDF
datos 1
axelerik
 
PPTX
Taller base de datos 1
yesebeth
 
Pagerank (1)
marlon
 
Course 100 introduction to performance canvas
Jennifer Howell
 
Pagerank (1)
marlon
 
Embedding BI
Jennifer Howell
 
Mantenimiento
Cindhy Celi
 
datos 1
axelerik
 
Taller base de datos 1
yesebeth
 

Viewers also liked (7)

PPTX
Mapeo objeto relacional
IsabelAlisson
 
PDF
paper shop
Xiva Sandoval
 
PDF
Tutorial de instalacion de xampp
Nany Galvez Vasquez
 
PDF
Achegamento a historia da fotografia en ourense
Biblioteca Museo Etnolóxico. Ribadavia
 
PDF
Tratado practico de fotografia industrial
Biblioteca Museo Etnolóxico. Ribadavia
 
PDF
Programas de aplicación
Yannina Moreno
 
Mapeo objeto relacional
IsabelAlisson
 
paper shop
Xiva Sandoval
 
Tutorial de instalacion de xampp
Nany Galvez Vasquez
 
Achegamento a historia da fotografia en ourense
Biblioteca Museo Etnolóxico. Ribadavia
 
Tratado practico de fotografia industrial
Biblioteca Museo Etnolóxico. Ribadavia
 
Programas de aplicación
Yannina Moreno
 
Ad

Similar to Pagerank (1) (20)

PPT
Pagerank
Byron Zavala
 
PPT
Pagerank
Jimmy Gonzàlez
 
PPT
Pagerank
espol
 
PPT
Pagerank
Alex
 
PPT
Pagerank
César García
 
PPT
Pagerank
Felix
 
PPT
Power Point
Espol
 
PPT
Pagerank
Rodrigo Amaya
 
PPT
Pagerank (1)
Bismark
 
PPT
Pagerank
Juan Carlos Marchan
 
PPT
Pagerank (1)
Gisselle Arcentales
 
PPT
Pagerank
guest0f43f1
 
PPT
Pagerank
Ronald
 
PPT
Pagerank (1)
Carlos Fernando
 
PPT
Pagerank (1)
Fabricio
 
PPT
Pagerank
espol
 
PPT
Pagerank
Diego
 
PPT
prueba
Jorge Baquero
 
PPT
Pagerank
davidmurrieta
 
Pagerank
Byron Zavala
 
Pagerank
Jimmy Gonzàlez
 
Pagerank
espol
 
Pagerank
Alex
 
Pagerank
César García
 
Pagerank
Felix
 
Power Point
Espol
 
Pagerank
Rodrigo Amaya
 
Pagerank (1)
Bismark
 
Pagerank (1)
Gisselle Arcentales
 
Pagerank
guest0f43f1
 
Pagerank
Ronald
 
Pagerank (1)
Carlos Fernando
 
Pagerank (1)
Fabricio
 
Pagerank
espol
 
Pagerank
Diego
 
prueba
Jorge Baquero
 
Pagerank
davidmurrieta
 
Ad

Recently uploaded (20)

PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
The Future of Artificial Intelligence (AI)
Mukul
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 

Pagerank (1)

  • 1. Motivation When searching for information on the WWW, user perform a query to a search engine. The engine return, as the query’s result, a list of Web sites which usually is a huge set. So the ranking of these web sites is very important. Because much information is contained in the link-structure of the WWW, information such as which pages are linked to others can be used to augment search algorithms.
  • 2. The Stochastic Approach for Link-Structure Analysis (SALSA) and the TKC Effect The PageRank Citation Ranking: Bringing Order to the Web
  • 3. Paper 1----SALSA authorities: web pages that have many outlinks hubs: web pages that point to many authoritative sites Hubs and authorities form communities , the most prominent community is called the principal community .
  • 4. SALSA----Idea SALSA is based upon the theory of Markov chains, and relies on the stochastic properties of random walks performed on our collection of sites. The input to our scheme consists of a collection of sites C which is built around a topic t . Intuition suggests that authoritative sites on topic t should be visible from many sites in the subgraph induced by C . Thus, a random walk on this subgraph will visit t -authorities with high probability.
  • 5. SALSA----Idea Combine the theory of random walks with the notion of the two distinct types of Web sites, hubs and authorities, and actually analyze two different Markov chains: A chain of hubs and a chain of authorities. Analyzing both chains allows our approach to give each Web site two distinct scores, a hub score and an authority score.
  • 6. SALSA----Computing Now define two stochastic matrices, which are the transition matrices of the two Markov chains at interest: The hub-matrix H : The authority-matrix à   :
  • 7. SALSA the principal community of authorities(hubs) found by the SALSA will be composed of the sites whose entries in the principal eigenvector of A ( H ) are the highest.
  • 8. SALSA----Conclusion SALSA is a new stochastic approach for link structure analysis, which examines random walks on graphs derived from the link structure. The principal community of authorities(hubs) corresponds to the sites that are most frequently visited by the random walk defined by the authority(hub) Markov chain.
  • 9. The PageRank Citation Ranking: Bringing Order to the Web Larry Page etc. Stanford University
  • 10. PageRank----Idea Every page has some number of forward links(outedges) and backlinks(inedges)
  • 11. PageRank----Idea Two cases PageRank is interesting: Web pages vary greatly in terms of the number of backlinks they have. For example, the Netscape home page has 62,804 backlinks compared to most pages which have just a few backlinks. Generally, highly linked pages are more “important” than pages with few links.
  • 12. PageRank----Idea Backlinks coming from important pages convey more importance to a page. For example, if a web page has a link off the yahoo home page, it may be just one link but it is a very important one. A page has high rank if the sum of the ranks of its backlinks is high. This covers both the case when a page has many backlinks and when a page has a few highly ranked backlinks.
  • 13. PageRank----Definition u: a web page F u : set of pages u points to B u : set of pages that point to u N u =|F u |: the number of links from u c: a factor used for normalization The equation is recursive, but it may be computed by starting with any set of ranks and iterating the computation until it converges.
  • 14. PageRank----definition A problem with above definition: rank sink If two web pages point to each other but to no other page, during the iteration, this loop will accumulate rank but never distribute any rank.
  • 15. PageRank----definition Definition modified: E(u) is some vector over the web pages(for example uniform, favorite page etc.) that corresponds to a source of rank. E(u) is a user designed parameter.
  • 16. PageRank----Random Surfer Model The definition corresponds to the probability distribution of a random walk on the web graphs. E(u) can be thought as the random surfer gets bored periodically and jumps to a different page and not kept in a loop forever.
  • 17. PageRank----Conclution PageRank is a global ranking based on the web's graph structure PageRank use backlinks information to bring order to the web PageRank can be thought of as random surfer model.
  • 18. Compare----SALSA and PageRank Both ranking web page by link structure information. Both are based on the graph of the web. Both use random walk idea.