The main problem with our average Joe user's way of seeing things, is that we're not that good at it.
1. we lack the required tools of seeing deep intro the code;
2. we lack the required knowledge to read it and interpret it adequately.
GPU Z, MSI Afterburner or other monitoring apps, may show 99% usage on the GPU, but it's quite clear that the video card isn't working that hard by the amount of heat it puts out or just by knowing that it's running an old or not that demanding code: dx 9 or 8 vs, newer versions or even on the same - DX11.
At the same time, the card may indeed work full speed, with everything on, BUT can work inefficiently.
Just checked the situation where I've got the most gains in FPS in BF 4 (roughly 50%) and although the GPU stays at 99% and the difference in CPU usage is under 4% (between Mantle and DX11.1), performance wise, the gap is HUGE. Another fact, vRAM without Mantle was around 1,6GB, while with Mantle, around 2,8GB. 1,8 vs. 2,1 GB RAM usage (w/o vs with mantle).
Coming back to Star Swarm, using Follow, Extreme and Timed Run settings, Mantle gives 99% usage on GPU, about 2,5GB vRAM used, while dx goes between 50-70% GPU usage and about 1,5GB vRAM. Judging by the fact they are really good with d3d, I'd say it speaks for itself.
An API like this is required and we ALL benefit from it. I pretty much would like to keep and use my hardware as close as possible to it's full potential. Name it Mantle, DX12, OpenGL 5 or whatever, doesn't really matter. Even AMD said that. But a change is required and not for the sake of changing things.