A few secrets about GTX 970
UPDATE 1/28/15 @ 10:25am ET: NVIDIA has posted in its official GeForce.com forums that they are working on a driver update to help alleviate memory performance issues in the GTX 970 and that they will "help out" those users looking to get a refund or exchange.
Yes, that last 0.5GB of memory on your GeForce GTX 970 does run slower than the first 3.5GB. More interesting than that fact is the reason why it does, and why the result is better than you might have otherwise expected. Last night we got a chance to talk with NVIDIA’s Senior VP of GPU Engineering, Jonah Alben on this specific concern and got a detailed explanation to why gamers are seeing what they are seeing along with new disclosures on the architecture of the GM204 version of Maxwell.
NVIDIA's Jonah Alben, SVP of GPU Engineering
For those looking for a little background, you should read over my story from this weekend that looks at NVIDIA's first response to the claims that the GeForce GTX 970 cards currently selling were only properly utilizing 3.5GB of the 4GB frame buffer. While it definitely helped answer some questions it raised plenty more which is whey we requested a talk with Alben, even on a Sunday.
Let’s start with a new diagram drawn by Alben specifically for this discussion.
GTX 970 Memory System
Believe it or not, every issue discussed in any forum about the GTX 970 memory issue is going to be explained by this diagram. Along the top you will see 13 enabled SMMs, each with 128 CUDA cores for the total of 1664 as expected. (Three grayed out SMMs represent those disabled from a full GM204 / GTX 980.) The most important part here is the memory system though, connected to the SMMs through a crossbar interface. That interface has 8 total ports to connect to collections of L2 cache and memory controllers, all of which are utilized in a GTX 980. With a GTX 970 though, only 7 of those ports are enabled, taking one of the combination L2 cache / ROP units along with it. However, the 32-bit memory controller segment remains.
You should take two things away from that simple description. First, despite initial reviews and information from NVIDIA, the GTX 970 actually has fewer ROPs and less L2 cache than the GTX 980. NVIDIA says this was an error in the reviewer’s guide and a misunderstanding between the engineering team and the technical PR team on how the architecture itself functioned. That means the GTX 970 has 56 ROPs and 1792 KB of L2 cache compared to 64 ROPs and 2048 KB of L2 cache for the GTX 980. Before people complain about the ROP count difference as a performance bottleneck, keep in mind that the 13 SMMs in the GTX 970 can only output 52 pixels/clock and the seven segments of 8 ROPs each (56 total) can handle 56 pixels/clock. The SMMs are the bottleneck, not the ROPs.
Subject: Graphics Cards | January 23, 2015 - 11:09 PM | Sebastian Peak
Tagged: nvidia, gtx 960, graphics drivers, graphics cards, GeForce 347.25, geforce, game ready, dying light
With the release of GTX 960 yesterday NVIDIA also introduced a new version of the GeForce graphics driver, 347.25 - WHQL.
NVIDIA states that the new driver adds "performance optimizations, SLI profiles, expanded Multi-Frame Sampled Anti-Aliasing support, and support for the new GeForce GTX 960".
While support for the newly released GPU goes without saying, the expanded MFAA support will help provide better anti-aliasing performance to many existing games, as “MFAA support is extended to nearly every DX10 and DX11 title”. In the release notes three games are listed that do not benefit from the MFAA support, as “Dead Rising 3, Dragon Age 2, and Max Payne 3 are incompatible with MFAA”.
347.25 also brings additional SLI profiles to add support for five new games, and a DirectX 11 SLI profile for one more:
SLI profiles added
- Black Desert
- Lara Croft and the Temple of Osiris
- Zhu Xian Shi Jie
- The Talos Principle
DirectX 11 SLI profile added
- Final Fantasy XIV: A Realm Reborn
The update is also the Game Ready Driver for Dying Light, a zombie action/survival game set to debut on January 27.
Much more information is available under the release notes on the driver download page, and be sure to check out Ryan’s chat with Tom Peterson from the live stream for a lot more information about this driver and the new GTX 960 graphics card.
Subject: General Tech, Graphics Cards | January 22, 2015 - 06:44 PM | Ryan Shrout
Tagged: video, tom petersen, nvidia, maxwell, live, gtx 960, gtx, GM206, geforce
UPDATE 2: If you missed the live stream you missed the prizes! But you can still watch the replay to get all the information and Q&A that went along with it as we discuss the GTX 960 and many more topics from the NVIDIA universe.
UPDATE (1/22): Well, the secret is out. Today's discussion will be about the new GeForce GTX 960, a $199 graphics card that takes power efficiency to a previously un-seen level! If you haven't read my review of the card yet, you should do so first, but then be sure you are ready for today's live stream and giveaway - details below! And don't forget: if you have questions, please leave them in the comments!
Get yourself ready, it’s time for another GeForce GTX live stream hosted by PC Perspective’s Ryan Shrout and NVIDIA’s Tom Petersen. Though we can’t dive into the exact details of what topics are going to be covered, intelligent readers that keep an eye on the rumors on our site will likely be able to guess what is happening on January 22nd.
On hand to talk about the products, answer questions about technologies in the GeForce family including GPUs, G-Sync, GameWorks, GeForce Experience and more will be Tom Petersen, well known on the LAN party and events circuit. To spice things up as well Tom has worked with graphics card partners to bring along a sizeable swag pack to give away LIVE during the event, including new GTX graphics cards. LOTS of graphics cards.
NVIDIA GeForce GTX 960 Live Stream and Giveaway
10am PT / 1pm ET - January 22nd
Need a reminder? Join our live mailing list!
Here are some of the prizes we have lined up for those of you that join us for the live stream:
- 3 x MSI GeForce GTX 960 Graphics Cards
- 4 x EVGA GeForce GTX 960 Graphics Cards
- 3 x ASUS GeForce GTX 960 Graphics Cards
Thanks to ASUS, EVGA and MSI for supporting the stream!
The event will take place Thursday, January 22nd at 1pm ET / 10am PT at http://www.pcper.com/live. There you’ll be able to catch the live video stream as well as use our chat room to interact with the audience, asking questions for me and Tom to answer live. To win the prizes you will have to be watching the live stream, with exact details of the methodology for handing out the goods coming at the time of the event.
Tom has a history of being both informative and entertaining and these live streaming events are always full of fun and technical information that you can get literally nowhere else. Previous streams have produced news as well – including statements on support for Adaptive Sync, release dates for displays and first-ever demos of triple display G-Sync functionality. You never know what’s going to happen or what will be said!
If you have questions, please leave them in the comments below and we'll look through them just before the start of the live stream. Of course you'll be able to tweet us questions @pcper and we'll be keeping an eye on the IRC chat as well for more inquiries. What do you want to know and hear from Tom or I?
So join us! Set your calendar for this coming Thursday at 1pm ET / 10am PT and be here at PC Perspective to catch it. If you are a forgetful type of person, sign up for the PC Perspective Live mailing list that we use exclusively to notify users of upcoming live streaming events including these types of specials and our regular live podcast. I promise, no spam will be had!
Subject: Graphics Cards | January 22, 2015 - 01:44 PM | Jeremy Hellstrom
Tagged: video, nvidia, msi gaming 2g, maxwell, gtx 960, GM206, geforce
Did Ryan somehow miss a benchmark that is important to you? Perhaps [H]ard|OCP's coverage of the MSI GeForce GTX 960 GAMING 2G will capture that certain something. MSI runs their 960 at a base of 1216MHz with the boost clock hitting 1279MHz, slightly slower than the ASUS STRIX at 1291 MHz and 1317 MHz. At the time this was posted the cards were available on Amazon for $210, that is obviously going to change so keep an eye out. As [H] states in their conclusions, it is a good value but not the great value which the GTX 970 offered at release, check out their full review here or one of the many down below.
"NVIDIA is today launching a GPU aimed at the "sweet spot" of the video card market. With an unexpectedly low MSRP, we find out if the new GeForce GTX 960 has what it takes to compete with the competition. The MSI GTX 960 GAMING reviewed here today is a retail card you will be able to purchase. No reference card in this review."
Here are some more Graphics Card articles from around the web:
- Nvidia's GeForce GTX 960 @ The Tech Report
- Zotac GTX 960 AMP!-edition @ Bjorn3d
- NVIDIA GeForce GTX 960: A Great $200 GPU For Linux Gamers @ Phoronix
- Palit GTX 960 Super JetStream 2 GB @ techPowerUp
- Gigabyte GTX 960 G1 Gaming 2GB @ Modders-Inc
- NVIDIA, MSI, EVGA GTX 960 Review @ OCC
- NVIDIA GeForce GTX 960 SLI @ techPowerUp
- EVGA GTX 960 Super Superclocked Video Card Review @ Hardware Asylum
- ASUS STRIX GTX 960 Review @ Neoseeker
- MSI GTX 960 Gaming OC 2 GB @ techPowerUp
- GTX 960 @ HardwareHeaven
- Gigabyte GTX960 G1 Gaming SOC @ Kitguru
- EVGA GTX 960 SSC 2 GB @ techPowerUp
- ASUS GTX 960 STRIX OC 2 GB @ techPowerUp
- Asus GTX960 Strix OC Edition @ Kitguru
- ASUS Strix Edition GeForce GTX 960 Graphics Card Review @ Techgage
- Palit GeForce GTX 960 JetStream @ Legion Hardware
- The NVIDIA GTX 960 Performance Review @ Hardware Canucks
- EVGA GeForce GTX 970 SSC ACX 2.0 @ HardwareOverlock
- NVIDIA GeForce GTX 970/980: Windows vs. Ubuntu Linux Performance @ Phoronix
- 22-Way AMD+NVIDIA Graphics Card Tests With Metro Redux On Steam For Linux @ Phoronix
A new GPU, a familiar problem
Editor's Note: Don't forget to join us today for a live streaming event featuring Ryan Shrout and NVIDIA's Tom Petersen to discuss the new GeForce GTX 960. It will be live at 1pm ET / 10am PT and will include ten (10!) GTX 960 prizes for participants! You can find it all at http://www.pcper.com/live
There are no secrets anymore. Calling today's release of the NVIDIA GeForce GTX 960 a surprise would be like calling another Avenger's movie unexpected. If you didn't just assume it was coming chances are the dozens of leaks of slides and performance would get your attention. So here it is, today's the day, NVIDIA finally upgrades the mainstream segment that was being fed by the GTX 760 for more than a year and half. But does the brand new GTX 960 based on Maxwell move the needle?
But as you'll soon see, the GeForce GTX 960 is a bit of an odd duck in terms of new GPU releases. As we have seen several times in the last year or two with a stagnant process technology landscape, the new cards aren't going be wildly better performing than the current cards from either NVIDIA for AMD. In fact, there are some interesting comparisons to make that may surprise fans of both parties.
The good news is that Maxwell and the GM206 GPU will price out starting at $199 including overclocked models at that level. But to understand what makes it different than the GM204 part we first need to dive a bit into the GM206 GPU and how it matches up with NVIDIA's "small" GPU strategy of the past few years.
The GM206 GPU - Generational Complexity
First and foremost, the GTX 960 is based on the exact same Maxwell architecture as the GTX 970 and GTX 980. The power efficiency, the improved memory bus compression and new features all make their way into the smaller version of Maxwell selling for $199 as of today. If you missed the discussion on those new features including MFAA, Dynamic Super Resolution, VXGI you should read that page of our original GTX 980 and GTX 970 story from last September for a bit of context; these are important aspects of Maxwell and the new GM206.
NVIDIA's GM206 is essentially half of the full GM204 GPU that you find on the GTX 980. That includes 1024 CUDA cores, 64 texture units and 32 ROPs for processing, a 128-bit memory bus and 2GB of graphics memory. This results in half of the memory bandwidth at 112 GB/s and half of the peak compute capability at 2.30 TFLOPS.
Subject: Graphics Cards | January 14, 2015 - 10:49 AM | Sebastian Peak
Tagged: rumors, NVIDA, leak, gtx 960, gpu, geforce
The GPU news and rumor site VideoCardz.com had yet another post about the GTX 960 yesterday, and this time the site claims they have most of the details about this unreleased GPU with new leaked photos from a forum on the Chinese site PCEVA.
The card is reportedly based on Maxwell GM206, a 1024 CUDA core part recently announced with the introduction of the GTX 965M. Clock speed was not listed but alleged screenshots indicate the sample had a 1228 MHz core and 1291 MHz Boost clock. The site is calling this an overclock, but it's still likely that the core would have a faster clock speed than the GTX 970 and 980.
The card will reportedly feature 2GB of 128-bit GDDR5 memory, though doubtless 4GB variants would likely be available after launch from the various vendors (an important option considering the possibility of the new card natively supporting triple DisplayPort monitors). Performance will clearly be a step down from the initial GTX 900-series offerings as NVIDIA has led with their more performant parts, but the 960 should still be a solid choice for 1080p gaming if these screenshots are real.
The specs as listed on the page at VideoCardz.com are follows (they do not list clock speed):
- 28nm GM206-300 GPU
- 1024 CUDA cores
- 64(?) TMUs
- 32 ROPs
- 1753 MHz memory
- 128-bit memory bus
- 2GB memory size
- 112 GB/s memory bandwidth
- DirectX 11.3/12
- 120W TDP
- 1x 6-pin power connector
- 1x DVI-I, 1x HDMI 2.0, 3x DP
We await official word on pricing and availability for this unreleased GPU.
Subject: Graphics Cards | January 13, 2015 - 02:28 PM | Sebastian Peak
Tagged: rumors, nvidia, multi monitor, mini-ITX GPU, leak, HDMI 2.0, gtx 960, gpu, geforce, DisplayPort
The crew at VideoCardz.com have been reporting some GTX 960 sightings lately, and today they've added no less than three new cards from KFA2, the "European premium brand" of Galaxy.
The reported reference design GTX 960 (VideoCardz.com)
Such reports are becoming more common, with the site posting photos that appear to be other vendors' versions of the new GPU here, here, and here. Of note with these new alleged photos on what appears to be a reference design board: no less than three DisplayPort outputs, as well as HDMI 2.0 and DVI:
Reported GTX 960 outputs (VideoCardz.com)
This would be big news for multi-monitor users as it would provide potential support three high-resolution DisplayPort monitors from a single card in a strictly non-gaming environment (unless you happen to enjoy the frame-rates of an oil painting).
The reported mini-ITX GTX 960 (VideoCardz.com)
The other designs shown in the post include a mini-ITX form-factor design still sporting the triple DisplayPorts, HDMI and DVI, and a larger EXOC edition built on a custom PCB.
Reported EXOC GTX 960 (VideoCardz.com)
The EXOC edition apparently drops the multi-DisplayPort option in favor of a second DVI output, leaving just one DisplayPort along with the lone HDMI 2.0 output.
With the GTX 960 leaks coming in daily now it seems likely that we would be hearing something official soon.
Subject: Graphics Cards | January 6, 2015 - 09:44 AM | Sebastian Peak
Tagged: rumor, nvidia, leak, gtx 960, GM206, geforce
VideoCardz.com is reporting that they not only know the upcoming GTX 960 core will be the GM206, but they reportedly have a photo of the unreleased chip.
Why are reported leaks always slightly out of focus? (Credit: VideoCardz.com)
The chip pictured appears to be a GM206-300, which the site claims will be the exact variant in the GTX 960 when it is released. The post speculates that based on the die size we can expect between 8 - 10 SMM's, or 1080 - 1280 CUDA cores. They further claim that the GTX 960 will have a 128-bit memory bus and reference cards will have a 2GB frame buffer (though naturally we can expect models with 4GB of memory after launch).
The post goes on to show what appears to be a search result for an ASUS GTX 960 on their site, but if this existed it has since been taken down. More than likely a GTX 960 is in fact close at hand, and the reported specs (and now multiple claimed listings for the card) are not hard to fathom.
We will keep you updated on this alleged new GPU if more details emerge.
Big Power, Small Size
Though the mindset that a small PC is a slow PC is fading, there are still quite a few readers out there that believe the size of your components will indicate how well they perform. That couldn't be further from the case, and this week we decided to build a small, but not tiny, PC to showcase that small can be beautiful too!
Below you will find a complete list of parts and components used in our build - but let me say right off the bat, to help alleviate as much vitriol in the comments as possible, there are quite a few ways you could build this system to either get a lower price, or higher performance, or quieter design, etc. Our selections were based on a balance of both with a nod towards expansion in a few cases.
Take a look:
|MicroATX Gaming Build|
|Processor||Intel Core i7-4790K - $334
Corsair Hydro Series H80i - $87
|Motherboard||Gigabyte Z97MX-Gaming 5 - $127|
|Memory||G.Skill Ripjaws X 8GB DDR3-2133 - $88|
|Graphics Card||EVGA GeForce GTX 970 FTW - $399|
|Storage||Samsung 250GB 850 EVO - $139
Western Digital 2TB Green - $79
|Case||Corsair Carbide Series Air 240 - $89|
|Power Supply||Seasonic Platinum 860 watt PSU - $174|
|OS||Windows 8.1 x64 - $92|
|Total Price||$1602 - Amazon Full Cart|
The starting point for this system is the Intel Core i7-4790K, the top-end Haswell processor for the Z97 chipset. In fact, the Core i7-4790K is a Devil's Canyon part, created by Intel to appease the enthusiast looking for an overclockable and high clocked quad-core part. This CPU will only lag behind the likes of the Haswell-E LGA2011 processors, but at just $340 or so, is significantly less expensive. Cooling the 4790K is Corsair's Hydro Series H80i double-thickness self contained water cooler.
For the motherboard I selected the Gigabyte Z97MX-Gaming 5, a MicroATX motherboard that combines performance and features in a mATX form factor, perfect for our build. This board includes support for SLI and CrossFire, has audio OP-AMP support, USB ports dedicated for DACs, M.2 storage support, Killer networking and more.
Subject: General Tech, Graphics Cards | December 28, 2014 - 09:47 PM | Scott Michaud
Tagged: radeon, nvidia, gtx, geforce, amd
According to an anonymous source of WCCFTech, AMD is preparing a 20nm-based graphics architecture that is expected to release in April or May. Originally, they predicted that the graphics devices, which they call R9 300 series, would be available in February or March. The reason for this “delay” is a massive demand for 20nm production.
The source also claims that NVIDIA will skip 20nm entirely and instead opt for 16nm when that becomes available (which is said to be mid or late 2016). The expectation is that NVIDIA will answer AMD's new graphics devices with a higher-end Maxwell device that is still at 28nm. Earlier rumors, based on a leaked SiSoftware entry, claim 3072 CUDA cores that are clocked between 1.1 GHz and 1.39 GHz. If true, this would give it between 6.75 and 8.54 TeraFLOPs of performance, the higher of which is right around the advertised performance of a GeForce Titan Z (only in a single compute device that does not require distribution of work like what SLI was created to automate).
Will this strategy work in NVIDIA's favor? I don't know. 28nm is a fairly stable process at this point, which will probably allow them to get chips that can be bigger and more aggressively clocked. On the other hand, they pretty much need to rely upon chips that are bigger and more aggressively clocked to be competitive with AMD's slightly more design architecture. Previous rumors also hint that AMD is looking at water-cooling for their reference card, which might place yet another handicap against NVIDIA, although cooling is not an area that NVIDIA struggles in.