FW: highly recommended talk today (sorry if you are getting this twice)

From: Anna Karlin (karlin@cs.washington.edu)
Date: Mon May 24 2004 - 11:40:38 PDT

Next message: Kelli McGee \(Kelly Services Inc\): "06/04/2004 - Algorithmic Construction of Sets for k-Restrictions; Dana Moshkovitz, Tel-Aviv University"

Previous message: Anna Karlin: "theory seminar this friday (ee1 045 11:30am)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]

Speaker: Yoav Freund
Columbia University

Title:Boosting algorithms and predictive modeling

Date: Monday, May 24, 2004

Time: 3:30 p.m.

Place: Gowen 301

Abstract:
Modern machine Learning algorithms such as support vector machines and boosting are rapidly becoming standard tools in the analysis of large and complex data-sets. Examples include the analysis of transaction records in business data mining, the analysis of gene regulatory networks in computational biology, event classification in experimental high-energy physics and object detection in computer vision.

The novelty of these methods is both statistical and computational. From the statistical point of view, these new methods perform surprisingly well in situations where the dimension of the data is far larger than the number of available data-points. This phenomenon can be explained in terms of the prediction "margins" and their relationship to prediction confidence.

>From the computational point of view, the main novelty of boosting is the discovery of an efficient way in which many weakly correlated features can be combined into a single accurate predictor.

There is an underlying paradigm shift in statistical analysis. Traditionally, the main goal of the statistician is to accurately estimate the parameters of a stochastic model of the system under investigation and thereby add directly to scientific knowledge. However, in an increasing number of applications, the goal is to construct a prediction function, implemented as a computer algorithm, which can predict some important aspect of the system. An intuitive interpretation of the parameters of this prediction function is of secondary importance. This brings machine learning methodology into direct conflict with the prevailing scientific research paradigm.

In this talk I will describe boosting algorithms, their theoretical analysis, some of their more interesting applications, and some of the directions of improvement. I will then describe some recent work on using boosting as a way of improving the efficiency of data collection. I'll conclude with some observations regarding the increasing need to interpret prediction functions and incorporate them into humanly understandable scientific knowledge.

_______________________________________________
Theory-group mailing list
Theory-group@cs.washington.edu
http://mailman.cs.washington.edu/mailman/listinfo/theory-group

Next message: Kelli McGee \(Kelly Services Inc\): "06/04/2004 - Algorithmic Construction of Sets for k-Restrictions; Dana Moshkovitz, Tel-Aviv University"

Previous message: Anna Karlin: "theory seminar this friday (ee1 045 11:30am)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]

This archive was generated by hypermail 2.1.6 : Mon May 24 2004 - 11:40:54 PDT