Steam-powered Turing Machine University of Washington Department of Computer Science & Engineering
 CSE 546 - Data Mining - Autumn 2003
  CSE Home  About Us    Search    Contact Info 

Instructor: pedrod at cs.washington.edu
Office: Allen 648
Office hours: Wednesdays 2:00-2:50 and by appointment
TA: mattr at cs.washington.edu
Office: Allen 220
Office hours: Mondays 4:30-5:20 and by appointment

Class meets:
Mondays and Wednesdays from 3:00 to 4:20 in MEB 242

Readings

Week 1: Chapter 1 of Hand and Behind-the-scenes data mining
Week 2: Chapter 3 of Mitchell
Week 3: Chapter 10 of Mitchell; review first-order logic
Week 4: Mining high-speed data streams, Mining complex models from arbitrarily large databases in constant time
Week 6: Chapter 6 of Mitchell; review probability and statistics
Week 7: Chapter 4 of Mitchell; review calculus
Week 8: Section 2 of Machine-learning research: Four current directions
Week 9: Chapter 7 of Mitchell, A unified bias-variance decomposition, A tutorial on support vector machines
Week 10: Chapter 9 of Hand

Lecture Notes

Week 1: Introduction, inductive learning
Week 2: Decision trees
Week 3: Rule induction
Week 4: Scalability
Week 5: Instance-based learning
Week 6: Bayesian learning
Week 7: Neural networks
Week 8: Model ensembles
Week 9: Learning theory and SVMs
Week 10: Clustering

Topics

The topics covered will have a non-null intersection with the following list:

Project

Class evaluation will be by means of a project. Projects can be proposed by the students - for example, applying data mining techniques to your area of interest - or chosen from this list: Projects can be carried out in groups of two or individually; we encourage working in groups. In addition to a written report, students will give a short oral presentation of their work.

Schedule of project presentations

Textbooks

Papers

Software

VFML is a set of tools developed at UW that you may find useful for your project. VFML is still in beta; if you're planning to use it, please contact ghulten at cs.washington.edu. Pointers to various pieces of data mining software can be found at KDnuggets.

Anonymous Feedback

Comments can be sent to the instructor or TAs using this anonymous feedback form.

Course Mailing List

To subscribe to the course mailing list, visit the mailing list home page. Alternatively, you can use the email interface to subscribe; send email to cse546-request@cs with the word "help" in the subject to receive a list of email command options.

Mailing List Archive


CSE logo Department of Computer Science & Engineering
University of Washington
Box 352350
Seattle, WA  98195-2350
(206) 543-1695 voice, (206) 543-2969 FAX
[comments to Pedro Domingos]