17
T3D
•Shmem-get and -put eliminate synchronization for the processor, though communication subsystem must
–Asymmetric
•There is a short sequence of instructions to initiate a transfer and then ~100 cycles
•A separate network implements global synchronization operations like (eureka)