Animated 3 by 3 weight-stationary systolic array computing C = X times W one clock cycle at a time.

Cycle 0 slow fast

Inputs X stream right →

Weights W resident · sums fall ↓

Result C = X·W

vector 0 → row 0 of C vector 1 → row 1 of C vector 2 → row 2 of C