原帖由 qcmadness 於 2011-8-16 09:43 發表
Core i7-3820: 4c8t, 3.6GHz/3.9GHz, 10MB L3 cache, 130W TDP, $294
Core i7-3930K: 6c12t, 3.2GHz/3.8GHz, 12MB L3 cache, 130W TDP, $583
Core i7-3960X: 6c12t, 3.3GHz/3.9GHz, 15MB L3 cache, 130W TDP, $999
...
原帖由 ccw 於 2011-8-16 09:48 發表
$294 very ok la.
It will out-perform 2600k in software like Photoshop CS5 with its quad channel ram.
原帖由 ccw 於 2011-8-16 10:06 發表
Sure if you take OC as an adv of 2600k, while personally I insist to stick with stock freq.
And is it really freq dependent on tasks like Photoshop CS5?
I have heard that a i7 950 can beat 2600 in C ...
原帖由 qcmadness 於 2011-8-16 10:15 發表
yes, freq dependent
http://www.anandtech.com/bench/CPU/25?i=287.288.363.192.100.47
原帖由 ccw 於 2011-8-16 10:43 發表
I see, I did heard someone claiming that the no. of ram channels made a difference for CS5(not 4),
while I cannot remember where I read that.
It was once forwarded by me to this forum.
原帖由 ccw 於 2011-8-16 10:50 發表
That's what makes me think that SBE 4C8T w/ quad channel rams can be favoring some specific tasks that i7 2600 may show its weakness at.
For it being priced the same as i7 2600, no doubt it is a new ...
原帖由 Henry 於 2011-8-16 15:45 發表
Video editing.
Why CUDA can do much faster than CPU, one of the reason is RAM bandwidth.
原帖由 qcmadness 於 2011-8-16 16:14 發表
of course not...
CUDA / Quicksync do faster because they are not flexible
原帖由 qcmadness 於 2011-8-16 16:14 發表
of course not...
CUDA / Quicksync do faster because they are not flexible
原帖由 Henry 於 2011-8-16 17:13 發表
simple maths, but many ops which is not much predictable and few can be cached in L2, so bandwidth to RAM is important. (need big fast pool)
QS not to comment, as mechanism is different.
CUDA, if low ...
原帖由 qcmadness 於 2011-8-17 00:29 發表
SIMD data唔駛太大的memory bandwidth
predict到, prefetch到既, 根本唔需要大bandwidth
graphics要咁大bandwidth有時係因為predict唔到
原帖由 Henry 於 2011-8-17 01:22 發表
The matrices to be processed in film ripping is similar to those in 3D games, often not so predictable. SIMD and AVX do help some due to data batching and perform vector or matrix operations each tim ...
原帖由 qcmadness 於 2011-8-17 01:46 發表
if you know the latency of a SIMD / AVX instruction, you will know why i say that the bandwidth is not important
and do you remember why p4 > c2d has doubled the SSEx performance while the bandwidth ...
原帖由 qcmadness 於 2011-8-17 01:55 發表
http://www.anandtech.com/bench/Product/100?vs=108
i7 950: 3.06GHz / 3.33GHz Turbo / 32.0 GB/s
i7 860: 2.80GHz / 3.46GHz Turbo / 21.3 GB/s
The max of ~10% gain is nowhere significant
原帖由 Henry 於 2011-8-17 02:59 發表
Increase in bandwidth needs the help of architecture in order to have improvements.
860 and 950 basically is the same chip.
If as you said, bandwidth is not so important, I think CUDA cards in datac ...
原帖由 Henry 於 2011-8-17 02:59 發表
Increase in bandwidth needs the help of architecture in order to have improvements.
860 and 950 basically is the same chip.
If as you said, bandwidth is not so important, I think CUDA cards in datac ...
I think we confirmed what we pretty much knew all along: Sandy Bridge's improved memory controller has all but eliminated the need for extreme memory bandwidth, at least for this architecture. It's only when you get down to DDR3-1333 that you see a minor performance penalty. The sweet spot appears to be at DDR3-1600, where you will see a minor performance increase over DDR3-1333 with only a slight increase in cost. The performance increase gained by going up to DDR3-1866 or DDR3-2133 isn't nearly as pronounced.
原帖由 qcmadness 於 2011-8-17 03:52 發表
http://www.anandtech.com/show/45 ... ing-the-best-ddr3/8
SB nowhere requires another 20GB/s memory bandwidth when ~20GB/s is enough.
原帖由 Henry 於 2011-8-17 04:48 發表
I have already read the review long time ago.
As I said, depends on arch, OPs performance, cores, also the tasks.
If CPU cannot feed the RAM bandwidth, CPU is the bottleneck.
If CPU need to wait for ...
原帖由 qcmadness 於 2011-8-17 05:01 發表
1. you still have not shown any tasks which scales well with memory bandwidth
2. SB-E for desktop only 4 and 6 oores. 8 cores is your IMAGINARY product
3. http://www.anandtech.com/bench/Product/313? ...
原帖由 Henry 於 2011-8-17 05:39 發表
1. I am sorry I cannot show this moment, but does not scale well =/= does not scale. (Something similar should be the first test in your link 3)
2. 8 core is still available in LGA2011 form, and I ca ...
原帖由 qcmadness 於 2011-8-17 11:58 發表
1. of course it scales, but is the scaling meaningful is another question
2. i don't think so, server socket for 4 qpi connections is different
3. decompression / compression is particularly bandwid ...
歡迎光臨 HKSpot (https://bbs.hk-spot.com/) | Powered by Discuz! 6.0 Lite |