|
CSE Home | About Us | Search | Contact Info |
|
The basic idea is to build a Google-style
search engine which is supreme at finding pages (initially restricted to
those on the University of
Washington family of web sites).
Crucial in this will be spidering strategy, index structures, snippet summary extraction, and ranking algorithms (specifically those based on hypertext analysis techniques). There are several possible extensions, including, two list just two, 1) an efficient implementation of pagerank, 2) focussing the crawler and making it find, classify and allow search for webcams by geographic queries. The project will be broken into parts, each with its own deadline and turn-in deliverables. In the first part, students will work alone, but we will form groups of two or three for the subsequent parts.
|
Department of Computer Science & Engineering University of Washington Box 352350 Seattle, WA 98195-2350 (206) 543-1695 voice, (206) 543-2969 FAX |