An Introduction to the Use of Link
Analysis by Web Search Engines (Amy Langville, NCSU and CofC)
Abstract: The first-generation search engines struggled to
meet user's needs. Search engines had a tough job to do.
The Web was growing at an exponential rate. Often thousands of
webpages contained information on the query terms. And spammers
were purposefully trying to deceive the search engines.
Fortunately, in 1998, two algorithms came to the rescue. These
two algorithms, PageRank, which is used by Google, and HITS, which is
used by Teoma, exploited the enormous directed graph created by the
hyperlink structure of the Web to rank the thousands of relevant pages
by their importance. These rankings dramatically improved search,
so that nearly every engine today uses some link analysis
algorithm. In this brief survey talk, I'll present the
PageRank algorithm (and perhaps the HITS algorithm, if time permits).
This is an abstract of a talk to be presented at the
2004 SEAMS Workshop in Charleston, SC. For more information, visit
the workshop's homepage at math.cofc.edu/SEAMS.