that data would prove if more l3 cache hinders performance as the op has stated. my suspicions are that the cpu with more cache would out perform the one with less cache in everything tested, clock for clock at same power targets.
The OP is talking about a single core 32bit chip running 20+ year old code.
I think if everything was even, the performance would be margin of error because AMD seem to have sized its caches pretty well for most tasks. Once the cache is full the difference would be the round trip journey to RAM. So massive.
What is needed is a CPU with 32gb of L1 running at the Thz range.