Trinity V's Piledriver, whats the diff?

mark2410 · 21 Oct 2013 at 12:51

okay so i got that trininy cpu have the onboard gpu side to them. i also get that trininy cpus range form deal to quad core and piledirvers go from quad to octo.

so, given i think im only interested in quad as i cant see what benefit id get form any more cores is there any real difference between them? i mean are there any performance differences between them. i feel like im not quite getting something here.

Martini1991 · 21 Oct 2013 at 13:04

Trinity also lacks L3 cache, is on another socket (FM2) which is dead, although works on the new FM2+ (Which isn't dead)

Piledriver is on AM3+ (Which may's well be dead) and has L3 cache.

steelwolf · 21 Oct 2013 at 20:50

Martini1991 said:
Piledriver is on AM3+ (Which may's well be dead) and has L3 cache.

I thought AMD were using AM3+ as there standardized socket from now on.

Martini1991 · 21 Oct 2013 at 20:52

AM3+ is from 2011.
FM2 was 2012 (Trinity) and FM2+ is 2013, and brand new (For Kaveri ; Steamroller APU's)

It's possible AM3+ will be killed off, with a new socket for 8 core CPU's.
I'm perhaps overplaying it, as it's far from official that AM3+ will be killed off, but it's looking likely.

steelwolf · 21 Oct 2013 at 20:56

Martini1991 said:
AM3+ is from 2011.
FM2 was 2012 (Trinity) and FM2+ is 2013, and brand new (For Kaveri ; Steamroller APU's)

It's likely AM3+ will be killed off, with a new socket for 8 core CPU's.

You may be right, its just what i read although it was written back in april.

http://www.tomshardware.com/news/FM1-FM2-AM3-AM3-AMD,21912.html

Martini1991 · 21 Oct 2013 at 20:57

That report doesn't make much sense really.

Avenged7Fold · 21 Oct 2013 at 21:52

AM3 and FM1 have been outdated sockets for a while now. Its like the first paragraph and title was written several years ago.

CAT-THE-FIFTH · 21 Oct 2013 at 23:31

Piledriver is the core used in Trinity and Vishera.

ALXAndy · 21 Oct 2013 at 23:49

CAT-THE-FIFTH said:
Piledriver is the core used in Trinity and Vishera.

Hmm you sure? I thought Piledriver was a controller refresh and the cores were the same Bulldozer cores?

Edit. That's a question btw as I'm not sure.

d_brennen · 21 Oct 2013 at 23:55

Too many code names. Thread title should be Trinity vs Vishera. Piledriver is the name of the core, which is common to both, minus L3 cache in APU and Athlon guise.

The lack of L3 makes a slight difference, mostly in productivity software.

CAT-THE-FIFTH · 22 Oct 2013 at 00:05

ALXAndy said:
Hmm you sure? I thought Piledriver was a controller refresh and the cores were the same Bulldozer cores?

Edit. That's a question btw as I'm not sure.

Piledriver had some tweaks as a core when compared to Bulldozer:

http://www.anandtech.com/show/5831/amd-trinity-review-a10-4600m-a-new-hope

Screen%20Shot%202012-05-14%20at%2010.38.42%20PM_575px.png

Anandtech said:
Fundamental to Piledriver is a significant switch in the type of flip-flops used throughout the design. Flip-flops, or flops as they are commonly called, are simple pieces of logic that store some form of data or state. In a microprocessor they can be found in many places, including the start and end of a pipeline stage. Work is done prior to a flop and committed at the flop or array of flops. The output of these flops becomes the input to the next array of logic. Normally flops are hard edge elements—data is latched at the rising edge of the clock.

In very high frequency designs however, there can be a considerable amount of variability or jitter in the clock. You either have to spend a lot of time ensuring that your design can account for this jitter, or you can incorporate logic that's more tolerant of jitter. The former requires more effort, while the latter burns more power. Bulldozer opted for the latter.

In order to get Bulldozer to market as quickly as possible, after far too many delays, AMD opted to use soft edge flops quite often in the design. Soft edge flops are the opposite of their harder counterparts; they are designed to allow the clock signal to spill over the clock edge while still functioning. Piledriver on the other hand was the result of a systematic effort to swap in smaller, hard edge flops where there was timing margin in the design. The result is a tangible reduction in power consumption. Across the board there's a 10% reduction in dynamic power consumption compared to Bulldozer, and some workloads are apparently even pushing a 20% reduction in active power. Given Piledriver's role in Trinity, as a mostly mobile-focused product, this power reduction was well worth the effort.

At the front end, AMD put in additional work to improve IPC. The schedulers are now more aggressive about freeing up tokens. Similar to the soft vs. hard flip flop debate, it's always easier to be conservative when you retire an instruction from a queue. It eases verification as you don't have to be as concerned about conditions where you might accidentally overwrite an instruction too early. With the major effort of getting a brand new architecture off of the ground behind them, Piledriver's engineers could focus on greater refinement in the schedulers. The structures didn't get any bigger; AMD just now makes better use of them.

The execution units are also a bit beefier in Piledriver, but not by much. AMD claims significant improvements in floating point and integer divides, calls and returns. For client workloads these gains show minimal (sub 1%) improvements.

Prefetching and branch prediction are both significantly improved with Piledriver. Bulldozer did a simple sequential prefetch, while Piledriver can prefetch variable lengths of data and across page boundaries in the L1 (mainly a server workload benefit). In Bulldozer, if prefetched data wasn't used (incorrectly prefetched) it would clog up the cache as it would come in as the most recently accessed data. However if prefetched data isn't immediately used, it's likely it will never be used. Piledriver now immediately tags unused prefetched data as least-recently-used, allowing the cache controller to quickly evict it if the prefetch was incorrect.

Another change is that Piledriver includes a perceptron branch predictor that supplements the primary branch predictor in Bulldozer. The perceptron algorithm is a history based predictor that's better suited for predicting certain branches. It works in parallel with the old predictor and simply tags branches that it is known to be good at predicting. If the old predictor and the perceptron predictor disagree on a tagged branch, the perceptron's path is taken. Improving branch prediction accuracy is a challenge, but it's necessary in highly pipelined designs. These sorts of secondary predictors are a must as there's no one-size-fits-all when it comes to branch prediction.

Finally, Piledriver also adds new instructions to better align its ISA with Haswell: FMA3 and F16C.

cheesyboy · 22 Oct 2013 at 09:17

Martini1991 said:
That report doesn't make much sense really.

Think CPU, not Motherboard. They're clearing down stock of FM1 and AM3 CPUs - we've already seen Phenom 965 (finally) disappear a couple of months ago, just got to get rid of those Semprons and Athlon X3s you can still buy, and the remaining Llano FM1 APUs.

Phixsator · 22 Oct 2013 at 10:20

cheesyboy said:
Think CPU, not Motherboard. They're clearing down stock of FM1 and AM3 CPUs - we've already seen Phenom 965 (finally) disappear a couple of months ago, just got to get rid of those Semprons and Athlon X3s you can still buy, and the remaining Llano FM1 APUs.

...Donate it all to 3rd world school efforts. Should be a win win for all. Good PR for AMD, new products for us, and who knows what a ton load of PCs can do for the 3rd world but i would guess its nothing short of huge.

Havana_UK · 22 Oct 2013 at 11:16

Phixsator said:
...Donate it all to 3rd world school efforts. Should be a win win for all. Good PR for AMD, new products for us, and who knows what a ton load of PCs can do for the 3rd world but i would guess its nothing short of huge.

Just imagine all the new spam we'll get about our long-lost uncle's fortunes!

mmj_uk · 22 Oct 2013 at 11:46

Does Trinity use the resonant clock mesh technology that reduces power consumption? I remember reading that it was left out of Piledriver because it hinders overclocking?

d_brennen · 22 Oct 2013 at 12:19

iirc Trinity and Richland have the kind of resonant clock mesh tech that AMD was working on. Doesn't hinder 6800Ks hitting ~5GHz

mark2410 · 22 Oct 2013 at 13:00

lol, so im not sure im feeling any more enlightened and clarified than i was before.

why does sa amd have so many differeing cores out anyway, llano, trinity, richland, etc etc its like they dont want it to be clear which chip sits where in their line up.

d_brennen · 22 Oct 2013 at 13:07

Rubbish inventory control.

dfour · 22 Oct 2013 at 13:33

Ignoring the socket differences, you have to remember that AMD have two core setups with its APU's and non-apu's and they sell them as 2 seperate lines where intel just have everything under the one line.

d_brennen · 22 Oct 2013 at 13:36

Just looking at OCUKs stock, there's socket 1155, 1150 and 2011 CPUs and even still 775 motherboards