With nearly 50% of the traffic generated by the whole of the search engines and directories (in France), Google must not be neglected. From a PhD research subject for two American academics (Larry Page and Sergey Brin) Google became a company on the international scene.
The success of this engine comes on the one hand from the algorithm worked out by the 2 founders, and on the other hand of the application of an elementary principle: the simplest things are sometimes most effective. In this case, Google chose a very stripped interface, without advertisement, by concentrating its services on the search for Web pages and nothing else. The engine also enjoys a very great speed in the interrogation of its data base.
In addition to results of research considered to be relevant by many users, Google succeeded to index a very great number of pages: its "index" is from now on one of the largest in the world (if it is not the first), with approximately 2 billion pages. Recently, new types of documents were indexed, in addition to the traditional HTML: Word, Excel, Acrobat, PowerPoint, WordPad, etc.
1) a precise analysis of the contents of the indexed pages (keywords, occurrences, positions in the document, type of HTML tag, etc.)
2) a classification of the pages according to their popularity (PageRank), calculated from the topology of
the Web (i.e. the whole structure of the documents and the links between them).
No comments:
Post a Comment