Backpropagation of errors

Use a multilayer feedforward network
- Outputs are differentiable functions of the inputs

Result: Can propagate blame back to internal nodes
- Chain rule (calculus) gives Dwij for internal nodes
- Use a gradient-descent learning rule

Use a nonlinear, differentiable activation function