Papers by Googlers
Google supplies a partial list of papers written by people now at Google.
The Nature of Meaning in the Age of Google
Terrence A. Brooks writes a paper about how search engines are changing the way we understand the world around us. (April, 2004)
The Google File System
By Ghemawat, Sanjay; Gobioff, Howard; and Leung, Shun-Tak. [PDF] (August, 2003)
Extrapolation Methods for Accelerating PageRank Computations
This paper by Sepandar Kamvar, Taher Haveliwala, Chris Manning, and Gene Golub, published in WWW13, presents an algorithm to speed up the computation of PageRank by making some initial approximations. [PDF] (May, 2003)
Adaptive Methods for the Computation of PageRank
This paper by Sepandar Kamvar, Taher Haveliwala, and Gene Golub describes an algorithm to speed up the computation of PageRank using the fact that pages converge at different rates. [PDF] (April, 2003)
Exploiting the Block Structure of the Web for Computing PageRank
This paper by Sepandar Kamvar, Taher Haveliwala, Chris Manning, and Gene Golub presents an algorithm to vastly speed up the computation of PageRank. [PDF] (March, 2003)
The Second Eigenvalue of the Google Matrix
This paper by Sepandar Kamvar and Taher Haveliwala proves analytically the second eigenvalue of the Google Matrix, which has implications for the PageRank algorithm. [PDF] (March, 2003)
WWW2003: Detecting near-replicas on the Web by content and hyperlink analysis
Paper by Ernesto Di Iorio, et. al. "In this paper we propose a technique for finding lists of similar documents, and in particular replicas and near-replicas, based on a pair of signatures which take into account both the document contents and the hyperlink structure. " (March, 2003)
United States Patent: 6,526,440
Ranking search results by reranking the results based on local inter-connectivity. Inventor Krishna Bharat; assignee Google. (February 25, 2003)
Topic-Sensitive PageRank
Taher H. Haveliwala's paper for the 11th International World Wide Web Conference explains that Google proposes to make PageRank reflect importance with respect to a particular topic. (May, 2002)
Building a Distributed Full-Text Index for the Web
Paper from WWW10 by Sergey Melnik, Sriram Raghavan, Beverly Yang, Hector Garcia-Molina from the Computer Science Department at Stanford University. (May, 2001)
A Case Study in Web Search using TREC Algorithms
Paper from WWW10 by Google employees Amit Singhal and Marcin Kaszkiel. (May, 2001)
Computing Iceberg Queries Efficiently
By Fang, Min; Shivakumar, Narayanan; Garcia-Molina, Hector; Motwani, Rajeev; Ullman, Jeffrey D. "In this paper we develop efficient execution strategies for an important class of queries that we call iceberg queries. An iceberg query performs an aggregate function over an attribute (or set of attributes) and then eliminates aggregate values that are below some specified threshold." [PDF] (November 11, 1999)
Dynamic Data Mining: Exploring Large Rule Spaces by Sampling
By Brin, Sergey; Page, Lawrence. Available in Postscript, PDF, and plain text formats. (November 11, 1999)
Extracting Patterns and Relations from the World Wide Web
By Brin, Sergey. [PDF] (November 11, 1999)
The PageRank Citation Ranking: Bringing Order to the Web
By Page, Lawrence; Brin, Sergey; Motwani, Rajeev; Winograd, Terry. Available in Postscript, PDF, and plain text formats. (November 11, 1999)
Efficient Computation of PageRank
By Haveliwala, T. [PDF] (September, 1999)
The Anatomy of a Large-Scale Hypertextual Web Search Engine
By Brin, Sergey; Page, Lawrence. (April 10, 1998)
Efficient Crawling Through URL Ordering
By Cho, Junghoo; Garcia-Molina, Hector; Page, Lawrence. Available in Postscript, PDF, and plain text formats. [PDF] (April, 1998)
Finding Near-replicas of Documents on the Web
By Shivakumar, N.; Garcia-Molina, H. Available in Postscript format. (March, 1998)
Results: 1 2