Google's PageRank and Beyond
-
Amy N. Langville
and Carl D. Meyer
About this book
Why doesn't your home page appear on the first page of search results, even when you query your own name? How do other web pages always appear at the top? What creates these powerful rankings? And how? The first book ever about the science of web page rankings, Google's PageRank and Beyond supplies the answers to these and other questions and more.
The book serves two very different audiences: the curious science reader and the technical computational reader. The chapters build in mathematical sophistication, so that the first five are accessible to the general academic reader. While other chapters are much more mathematical in nature, each one contains something for both audiences. For example, the authors include entertaining asides such as how search engines make money and how the Great Firewall of China influences research.
The book includes an extensive background chapter designed to help readers learn more about the mathematics of search engines, and it contains several MATLAB codes and links to sample web data sets. The philosophy throughout is to encourage readers to experiment with the ideas and algorithms in the text.
Any business seriously interested in improving its rankings in the major search engines can benefit from the clear examples, sample code, and list of resources provided.
- Many illustrative examples and entertaining asides
- MATLAB code
- Accessible and informal style
- Complete and self-contained section for mathematics review
Author / Editor information
Reviews
"The book under review is excellently written, with a fresh and engaging style. The reader will particularly enjoy the 'Asides' interspersed throughout the text. They contain all kind of entertaining stories, practical tips, and amusing quotes. . . . The book also contains some useful resources for computation."---Pablo Fernández, Mathematical Intelligencer
"The book is very attractively and clearly written. The authors succeed to manage in an optimal way the presentation of both basic and more sophisticated concepts involved in the analysis of Google's PageRank, such that the book serves both audiences: the general and the technical scientific public."---Constantin Popa, Zentralblatt MATH
"This book is written for people who are curious about new science and technology as well as for those with more advanced background in matrix theory.... Much of the book can be easily followed by general readers, while understanding the remaining part requires only a good first course in linear algebra. It can be a reference book for people who want to know more about the ideas behind the currently popular search engines, and it provides an introductory text for beginning researchers in the area of information retrieval."---Jiu Ding, Mathemathical Reviews
"Langville and Meyer present the mathematics in all its detail. . . . But they vary the math with discussions of the many issues involved in building search engines, the 'wars' between search engine developers and those trying to artificially inflate the position of their pages, and the future of search-engine development. . . . Google's PageRank and Beyond makes good reading for anyone, student or professional, who wants to understand the details of search engines."---James Hendler, Physics Today
"If I were taking, or teaching, a course in linear algebra today, this book would be a godsend."---Ed Gerstner, Nature Physics
"Amy N. Langville and Carl D. Meyer examine the logic, mathematics, and sophistication behind Google's PageRank and other Internet search engine ranking programs. . . . It is an excellent work."---Ian D. Gordon, Library Journal
"This book should be at the top of anyone's list as a must-read for those interested in how search engines work and, more specifically how Google is to meet the needs of so many people in so many ways."---Michael W. Berry, SIAM Review
"This is a worthwhile book. It offers a comprehensive and erudite presentation of PageRank and related search-engine algorithms, and it is written in an approachable way, given the mathematical foundations involved."---Jonathan Bowen, Times Higher Education Supplement
"[F]or anyone who wants to delve deeply into just how Google's PageRank works, I recommend Google's PageRank and Beyond."---Stephen H. Wildstrom, BusinessWeek
"Honorable Mention for the 2006 Award for Best Professional/Scholarly Book in Computer & Information Science, Association of American Publishers"
"Comprehensive and engagingly written. This book should become an important resource for many audiences: applied mathematicians, search industry professionals, and anyone who wants to learn more about how search engines work."—Jon Kleinberg, Cornell University
"I don't think there are any competitive books in print with the same depth and breadth on the topic of search engine ranking. The content is well-organized and well-written."—Michael Berry, University of Tennessee
Topics
-
Download PDFPublicly Available
Frontmatter
i -
Download PDFPublicly Available
Contents
v -
Requires Authentication UnlicensedLicensed
Preface
ix -
Requires Authentication UnlicensedLicensed
Chapter One. Introduction to Web Search Engines
1 -
Requires Authentication UnlicensedLicensed
Chapter Two. Crawling, Indexing, and Query Processing
15 -
Requires Authentication UnlicensedLicensed
Chapter Three. Ranking Webpages by Popularity
25 -
Requires Authentication UnlicensedLicensed
Chapter Four. The Mathematics of Google’s PageRank
31 -
Requires Authentication UnlicensedLicensed
Chapter Five. Parameters in the PageRank Model
47 -
Requires Authentication UnlicensedLicensed
Chapter Six. The Sensitivity of PageRank
57 -
Requires Authentication UnlicensedLicensed
Chapter Seven. The PageRank Problem as a Linear System
71 -
Requires Authentication UnlicensedLicensed
Chapter Eight. Issues in Large-Scale Implementation of PageRank
75 -
Requires Authentication UnlicensedLicensed
Chapter Nine. Accelerating the Computation of PageRank
89 -
Requires Authentication UnlicensedLicensed
Chapter Ten. Updating the PageRank Vector
99 -
Requires Authentication UnlicensedLicensed
Chapter Eleven. The HITS Method for Ranking Webpages
115 -
Requires Authentication UnlicensedLicensed
Chapter Twelve. Other Link Methods for Ranking Webpages
131 -
Requires Authentication UnlicensedLicensed
Chapter Thirteen. The Future of Web Information Retrieval
139 -
Requires Authentication UnlicensedLicensed
Chapter Fourteen. Resources for Web Information Retrieval
149 -
Requires Authentication UnlicensedLicensed
Chapter Fifteen. The Mathematics Guide
153 -
Requires Authentication UnlicensedLicensed
Chapter Sixteen. Glossary
201 -
Requires Authentication UnlicensedLicensed
Bibliography
207 -
Requires Authentication UnlicensedLicensed
Index
219