The solutions should be ready for inspection by Thursday 22.11.2001 (midnight).
Google is a well-known search engine for the Web. Study the article: Sergey Brin and Lawrence Page The Anatomy of a Large-Scale Hypertextual Web Search Engine , which describes the implementation of Google as it still was an academic project (~1998).
Explain the high level architecture of Google as shown in Figure 1 (in the article):
Try to identify which parts of the architecture (and how) are needed in the searching process. That is: What happens when a user gives a few keywords and starts a search?
Compare text categorization and text summarization as learning problems (~ classification problems), for instance: