Use Processors More Effectively
•Assign t rows and the corresponding t columns to each processor so it can compute a t ´ t subarray
A
B
C
Each element requires n iterations
Modern pipelined processors benefit from large block of work