Well you done open a can of worms. That goes all the way back to 3DMark's TimeSpy use of Asynchronous compute. Async Compute was originally intended for executing instruction in parallel.
Nvidia's hardware, Maxwell/Pascal, can do it but at a much slower rate or it chokes which can cause crashes in OS. Many thought Nvidia couldn't execute parallel instructions but it always could. So they came up with a DX11 way of doing it. They use concurrent execution (DX11). Wow, you got me going back in time about that huge debate. Some of it can be found here:
https://steamcommunity.com/app/223850/discussions/0/366298942110944664/
When the use of the TimeSpy code was scurtinized popular opinion suggested that 3Dmark was favoring nvidia hardware and forcing AMD hardware to use concurrent execution (which it was not intended for). Eventually they posted
their own article about its use. Mind you this debate was in several other forums. Not just on steam.
What does that mean now? Very good question indeed. IMO Turing should be able to execute instruction in parallel with a level of efficiency greater or equal to Vega. So keep your eye on those game/benchmarks like Ashes of the Singularity.