More On Tera
Since there is a 16 instruction minimum issue it takes 16 threads to keep utilize the processor without hiding latency
Each processor has 128 fully replicated contexts
Synchronization latency can even be covered
When everything works, the Tera should approximate a PRAM