According to this:
http://developer.amd.com/gpu_assets/Shader Model 5-0 and Compute Shader.pps
The UAV is part of gather/scatter so not about GPU-System memory transfers.
Looking through the ATI/AMD 5800 architecture and review slides, nobody has mentioned anything on this. Why am I asking this?
In processing where there's not enough memory to pre-cache al graphics textures and 3D points for all the physics etc, then the system will have to mode data in/out. Without being able todo system-gpu and gpu-gpu transfers simultaneously this results is far far slower performance. In theory, the idea GPU should be able to DMA in & out to system memory why the stream processors are processing other independent data from memory..
CPU -> GPU physics data transfer
GPU -> GPU processing of physics data
GPU -> CPU physics data transfer
CPU -> GPU graphics data (textures/vertexes)
GPU -> GPU graphics rendering
repeat from first...
So because of this the GPU needs loads of memory to keep both the hi-res graphics AND the physics in the GPU memory:
CPU -> GPU physics data transfer/fragment shader programs
CPU -> GPU graphics data (textures/vertexes, shaders)
...
GPU -> GPU processing of physics data
GPU -> GPU process graphics data with physics data
GPU -> GPU graphics rendering
.. <at some point the game code has to process the entire scene to update with game AI move/user moves etc!>
So you end up doing everything in the GPU but you require a massive amount of GPU memory to effectively work.
In future I can see the move to push the entire game into the GPU, thus all the parallel processing (even full raytraced rendering) would be run within the GPU. CPU would result to coordination.
The problem is.. AMD have failed to spot this logical bottleneck.. nV's Fermi does provide 2 DMAs simultaneously (according to the slide set). This makes nV's performance for coupled physics/graphics {potentially} higher.
DX11's API allows the programmer to specify processing groups - however the AMD hardware cannot work efficiently with this at the moment.. perhaps the 5900 series
