Fiji performance has nothing to do with HBM.
512GB/s of bandwidth is 512GB/s of bandwidth no matter where it comes from or how it is achieved.
Fiji is Shader bottlenecked.
To some degree but things like latency and the nature of operation batching, etc. can have some impact also.