Steam-powered Turing Machine University of Washington Department of Computer Science & Engineering
 CSE454 Course Overview
  CSE Home   About Us    Search    Contact Info 

Administrivia
 Home
 Using course email
 Email archive
 Policies
Content
 Overview
 Resources
 Lecture slides
Assignments
 Reading
 Project
    The following outline is a tentative list of the topics we hope to cover.
  • Introduction 
    • Foundational protocols: HTTP, HTML, browser archiecture
    • Server basics, cookies, log files, dynamic page generation
    • Website management, N-tier architecture, scalability
  • Information Retrieval 
    • Traditional approaches
      • Ranking, TF/IDF, precision / recall, stemming, stop words
      • Latent Semantic Indexing
    • Web-oriented techniques
      • Hypertext analysis (page rank, hubs and authorities, anchor text)
      • Spamming: keyword stuffing, doorway/jump pages, cloaking, font tricks
      • Spider search strategy, macro structure of the Web
    • Implementation and scale-up issues
      • Index structures, stemming
      • Boolean processing
    • Summarization and snippets
  • Classification and Clustering on the Web 
    • Learning, classification, and datamining
    • Clustering of search engine results
    • Collaborative filtering, user modeling, adaptive websites
    • Topic-specific crawling
  • Information Extraction 
    • Question answering
    • KnowItAll
  • Special Topics 
    • Meta-search, query routing.
    • The Semantic Web, Semantic e-mail
    • Cryptography, security, privacy, P3P
    • Micropayments, digital cash, server-side wallets, and e-commerce

     


CSE logo Department of Computer Science & Engineering
University of Washington
Box 352350
Seattle, WA  98195-2350
(206) 543-1695 voice, (206) 543-2969 FAX