Sunday, November 1, 2009
Google began in January 1996 as a research project by Larry Page and Sergey Brin, a Ph.D. student at Stanford[1] working on the Stanford Digital Library Project (SDLP). The SDLP's goal was “to develop the enabling technologies for a single, integrated and universal digital library." and was funded through the National Science Foundation among other federal agencies.[2][3][4][5] In search for a dissertation theme, Page considered—among other things—exploring the mathematical properties of the World Wide Web, understanding its link structure as a huge graph.[6] His supervisor Terry Winograd encouraged him to pick this idea (which Page later recalled as "the best advice I ever got"[7]) and Page focused on the problem of finding out which web pages link to a given page, considering the number and nature of such backlinks to be valuable information about that page (with the role of citations in academic publishing in mind).[6] In his research project, nicknamed "BackRub", he was soon joined by Sergey Brin, a fellow Stanford Ph.D. student supported by a National Science Foundation Graduate Fellowship.[2] Brin was already a close friend, whom Page had first met in the summer of 1995 in a group of potential new students which Brin had volunteered to show around the campus.[6] Page's web crawler began exploring the web in March 1996, setting out from Page's own Stanford home page as its only starting point.[6] To convert the backlink data that it gathered into a measure of importance for a given web page, Brin and Page developed the PageRank algorithm.[6] Analyzing BackRub's output—which, for a given URL, consisted of a list of backlinks ranked by importance—it occurred to them that a search engine based on PageRank would produce better results than existing techniques (existing search engines at the time essentially ranked results according to how many times the search term appeared on a page).[6][8] A small search engine called RankDex was already exploring a similar strategy.[9]
Convinced that the pages with the most links to them from other highly relevant Web pages must be the most relevant pages associated with the search, Page and Brin tested their thesis as part of their studies, and laid the foundation for their search engine. By early 1997, the backrub page described the state as follows:[10]
Some Rough Statistics (from August 29th, 1996)Total indexable HTML urls: 75.2306 MillionTotal content downloaded: 207.022 gigabytes
Convinced that the pages with the most links to them from other highly relevant Web pages must be the most relevant pages associated with the search, Page and Brin tested their thesis as part of their studies, and laid the foundation for their search engine. By early 1997, the backrub page described the state as follows:[10]
Some Rough Statistics (from August 29th, 1996)Total indexable HTML urls: 75.2306 MillionTotal content downloaded: 207.022 gigabytes
Subscribe to:
Post Comments (Atom)
0 comments:
Post a Comment