•Matrix
Multiplication -- easiest solution
•A
systolic solution
•A
row/column solution + improvements
–More
work per “step”
–More
communication at a time
–Improve
locality
–Overlap
communication with computation
–Reorient
computation
•Discover
“best practical” MM
•Review
Role of Computation Model