Intel's Core i7 870 & i5 750, Lynnfield: Harder, Better, Faster Stronger
by Anand Lal Shimpi on September 8, 2009 12:00 AM EST- Posted in
- CPUs
Lynnfield's Turbo Mode: Up to 17% More Performance
Turbo on Bloomfield (the first Core i7) wasn't all that impressive. If you look back at our Core i7 article from last year you'll see that it's responsible for a 2 - 5% increase in performance depending on the application. All Bloomfield desktop CPUs had 130W TDPs, so each individual core had a bit more breathing room for how fast it could run. Lynnfield brings the TDP down around 27%, meaning each core gets less TDP to work with (the lower the TDP, the greater potential there is for turbo). That combined with almost a full year of improving yields on Nehalem means that Intel can be much more aggressive with Turbo on Lynnfield.
SYSMark 2007: Overall | Dawn of War II | Sacred 2 | World of Warcraft | |
Intel Core i7 870 Turbo Disabled | 206 | 74.3 fps | 84.8 fps | 60.6 fps |
Intel Core i7 870 Turbo Enabled | 233 | 81.0 fps | 97.4 fps | 70.7 fps |
% Increase from Turbo | 13.1% | 9.0% | 14.9% | 16.7% |
Turbo on Lynnfield can yield up to an extra 17% performance depending on the application. The biggest gains will be when running one or two threads as you can see from the table below:
Max Speed | Stock | 4 Cores Active | 3 Cores Active | 2 Cores Active | 1 Core Active |
Intel Core i7 870 | 2.93GHz | 3.20GHz | 3.20GHz | 3.46GHz | 3.60GHz |
Intel Core i7 860 | 2.80GHz | 2.93GHz | 2.93GHz | 3.33GHz | 3.46GHz |
Intel Core i5 750 | 2.66GHz | 2.80GHz | 2.80GHz | 3.20GHz | 3.20GHz |
If Intel had Turbo mode back when dual-cores first started shipping we would've never had the whole single vs. dual core debate. If you're running a single thread, this 774M transistor beast will turn off three of its cores and run its single active core at up to 3.6GHz. That's faster than the fastest Core 2 Duo on the market today.
WoW doesn't stress more than 2 cores, Turbo mode helps ensure the i7 870 is faster than Intel's fastest dual-core CPU
It's more than just individual application performance however, Lynnfield's turbo modes can kick in when just interacting with the OS or an application. Single threads, regardless of nature, can now execute at 3.6GHz instead of 2.93GHz. It's the epitomy of Intel's hurry up and get idle philosophy.
The ultimate goal is to always deliver the best performance regardless of how threaded (or not) the workload is. Buying more cores shouldn't get you lower clock speeds, just more flexibility. The top end Lynnfield is like buying a 3.46GHz dual-core processor that can also run well threaded code at 2.93GHz.
Take this one step further and imagine what happens when you have a CPU/GPU on the same package or better yet, on the same die. Need more GPU power? Underclock the CPU cores, need more CPU power? Turn off half the GPU cores. It's always availble, real-time-configurable processing power. That's the goal and Lynnfield is the first real step in that direction.
Speed Limits: Things That Will Keep Turbo Mode from Working
As awesome as it is, Turbo doesn't work 100% of the time, its usefulness varies on a number of factors including the instruction mix of active threads and processor cooling.
The actual instructions being executed by each core will determine the amount of current drawn and total TDP of the processor. For example, video encoding uses a lot of SSE instructions which in turn keep the SSE units busy on the chip; the front end remains idle and is clock gated, so power is saved there. The resulting power savings are translated into higher clock frequency. Intel tells us that video encoding should see the maximum improvement of two bins with all four cores active.
Floating point code stresses both the front end and back end of the pipe, here we should expect to see only a 133MHz increase from turbo mode if any at all. In short, you can't simply look at whether an app uses one, two or more threads. It's what the app does that matters.
There's also the issue of background threads running in the OS. Although your foreground app may only use a single thread, there are usually dozens (if not hundreds) of active threads on your system at any time. Just a few of those being scheduled on sleeping cores will wake them up and limit your max turbo frequency (Windows 7 is allegedly better at not doing this).
You can't really control the instruction mix of the apps you run or how well they're threaded, but this last point you can control: cooling. The sort-of trump all feature that you have to respect is Intel's thermal throttling. If the CPU ever gets too hot, it will automatically reduce its clock speed in order to avoid damaging the processor; this includes a clock speed increase due to turbo mode.
Lynnfield and its retail cooler
The retail cooler that ships with the Core i7 is tiny and while it's able to remove heat well enough to allow the chip to turbo up, we've seen instances where it doesn't turbo as well due to cooling issues. Just like we recommended in the Bloomfield days, an aftermarket cooler may suit you well.
Lynnfield: Made for Windows 7 (or vice versa)
Core Parking is a feature included in Windows 7 and enabled on any multi-socket machine or any system with Hyper Threading enabled (e.g. Pentium 4, Atom, Core i7). The feature looks at the performance penalty from migrating a thread from one core to another; if the fall looks too dangerous, Windows 7 won't jump - the thread will stay parked on that core.
What this fixes are a number of the situations where enabling Hyper Threading will reduce performance thanks to Windows moving a thread from a physical core to a logical core. This also helps multi-socket systems where moving a thread from one core to the next might mean moving it (and all of its data) from one memory controller to another one on an adjacent socket.
Core Parking can't help an application that manually assigns affinity to a core. We've still seen situations where HT reduces performance under Windows 7 for example with AutoCAD 2010 and World of Warcraft.
With support in the OS however, developers should have no reason to assign affinity in software - the OS is now smart enough to properly handle multi-socket and HT enabled machines.
343 Comments
View All Comments
lordmetroid - Tuesday, September 8, 2009 - link
I am using Linux!andrenb91 - Wednesday, September 9, 2009 - link
c'mon probably u still running windows for somethings...wine doesn't work owith every thin...i run liux on dual boot for years and still trying to make wine run fligh simulator x..which is the only game I play...remember, these benchmarkes are only for win bases pcs, in linux the history is diferent, see it at phoronix.com...james jwb - Tuesday, September 8, 2009 - link
is turbo boost on for the benchmarks?snakeoil - Tuesday, September 8, 2009 - link
yes they benchmarked with turbo boost, that is cheating because thats overclocking the processor at least 600 mhz and presenting the results as it were at stock speeds.that's abusing the reader's trust.
maxxcool - Tuesday, September 8, 2009 - link
Hahaha, you are just as much of a idiot here as on techreport snake! ... did you come here and claim to have proof that i5 will not run xp-mode to?hahahaha, your just sad that Amd did not come up with this feature 1st.
Jarp Habib - Tuesday, September 8, 2009 - link
"yes they benchmarked with turbo boost, that is cheating because thats overclocking the processor at least 600 mhz and presenting the results as it were at stock speeds.that's abusing the reader's trust. "
This statement is a load of bullcrap. Anand's intent is to present the benchmarks in a way reflective of the chip's standard performance in normal use- hence not manually overclocking for maximized performance. The processor's very design revolves on itself automatically shutting down inactive cores and boosting the speed of active cores, *regardless* of what the end user does to the chip in BIOS or what apps he's running. Since all you need to do to use Turbo Boost is just *install the CPU in your system* then benchmarks should be run with it enabled.
If you want to COMPLETELY level the playing field, then TurboBoost should be shut down, for both Bloomfield i7 chips and Lynnfield i5 AND Lynnfield i7, as well as future i3 and i9. Also, HyperThreading must be disabled from all chips, 3DNow!, SpeedStep, Cool N' Quiet, MMX and the entire SSE instruction sets. After all, each different type of CPU executes those standard instruction sets differently. And since the SpeedStep and Cool N Quiet instructions force the chip to underclock and shut off cores while at idle, they must be eliminated from testing as well, or they'll throw off your idle power consumption benchmarks.
Since you will be normalizing the clock frequencies as well, you can save time by only needing to test just one chip from each product line. I'm not sure just how you will normalize the clock frequencies of your test units *without overclocking or underclocking* some of them though. Perhaps you'll let me know?
Meanwhile, back in the real world...
Voo - Tuesday, September 8, 2009 - link
The difference is, that turbo mode impairs the possible benefit of overclocking the chip, while most things you enumerated do not.If you want to get the maximum out of the 860 you've got to disable turbo mode as we see in the review, so for everyone who'd want to overclock their CPU the most interesting test would be a comparison between the two chips both at their maximum stable performance. Which at the moment means disabling turbo mode as we can see.
erple2 - Tuesday, September 8, 2009 - link
A-HA! So really, you're just interested in the benchmark "What does the maximum overclock do", not "How does the CPU perform at normal operations". BTW, does disabling HT does improve overclocking a little bit, so should that also be disabled? Cool-n-Quiet plus SpeedStep may also affect overclocking capabilities. Should those be disabled? I fail to see the difference between what the GP said and your justifications.MadMan007 - Tuesday, September 8, 2009 - link
I'm not bothered by enabled Turboboost in a 'stock speed' review either but I would really like to see more sites run their benchmark suite with 3.6-4.0GHz (or higher) C2D, C2Q and Phenom II versus overclocked but non-Turboboost i5/i7. The reason is that this type of comparison would be most directly useful for the site's enthusiast readerships to know what the actual difference between *their rig* and an i5/i7 would be.Kaleid - Thursday, September 10, 2009 - link
Seconded.