Backpropagation of errors
Use a nonlinear, differentiable activation function
Use a multilayer feedforward network
- Outputs are differentiable functions of the inputs
Result: Can propagate blame back to internal nodes
- Chain rule (calculus) gives Dwij for internal nodes
- Use a gradient-descent learning rule