Outline

5/28/98

Click here to start

Table of Contents

Defining a Learning Problem

Concept Learning

Evaluating Attributes

Resulting Tree ….

Summary: Learning = Search

CorrespondenceA hypothesis = set of instances

Version Space: Compact Representation

Training Example 3

Two kinds of bias

Ensembles of Classifiers

Constructing Ensembles

Review: Learning

Softbot Perception Problem

Strategy: Wrappers

Wrapper Induction

LR wrappers: The basic idea

Country/Code LR wrapper

Inductive (example-driven) learning

Wrapper induction algorithm

Step 3: Finding an LR wrapper

LR: Finding l1, l2 and r2

Finding an LR wrapper: Algorithm

A problem with LR wrappers

The complication

A solution: HLRT wrappers

Country/Code HLRT wrapper

“Generic” HLRT wrapper

Wrapper induction algorithm

Step 3: Finding an HLRT wrapper

HLRT: Finding r1, l2 and r2

HLRT: Finding h, t, and l1

Finding an HLRT wrapper: Algorithm

Wrapper induction algorithm

Step 1. Termination condition

PAC model for HLRT

PAC model: Interpretation

Wrapper induction algorithm

Step 2. WIEN: Manual page labeling

Automatic page labeling

Corroboration of Imperfect Recognizers

Corroboration: Example

Summary of results

Q: Is wrapper induction practical?

Kushmerick Contributions

MDP Model of Agency

MDP Model (continued)

Good News and Bad News

MDP Model (continued)

Properties of the Model

Computing Optimal Policies

Policy Construction and Dynamic Programming

Value Iteration and Its Variants

Policy Iteration

Summary of MDP Solution Techniques

Reinforcement Learning

Q Learning (cont.)

Convergence of Q update

Summary of General MDP Model

Summary of Reinforcement Learning

Author: weld

Email: weld@cs.washington.edu

Other information:
CSE 592, Lecture 9

Download presentation source