Looking Towards the Professionals
This is a multi-part story for the NVIDIA Titan V:
Earlier this week we dove into the new NVIDIA Titan V graphics card and looked at its performacne from a gaming perspective. Our conclusions were more or less what we expected - the card was on average ~20% faster than the Titan Xp and about ~80% faster than the GeForce GTX 1080. But with that $3000 price tag, the Titan V isn't going to win any enthusiasts over.
What the Titan V is meant for in reality is the compute space. Developers, coders, engineers, and professionals that use GPU hardware for research, for profit, or for both. In that case, $2999 for the Titan V is simply an investment that needs to show value in select workloads. And though $3000 is still a lot of money, keep in mind that the NVIDIA Quadro GP100, the most recent part with full-performance double precision compute from the Pascal chip, is still selling for well over $6000 today.
The Volta GV100 GPU offers 1:2 double precision performance, equating to 2560 FP64 cores. That is a HUGE leap over the GP102 GPU used on the Titan Xp that uses a 1:32 ratio, giving us just 120 FP64 cores equivalent.
|Titan V||Titan Xp||GTX 1080 Ti||GTX 1080||GTX 1070 Ti||GTX 1070||RX Vega 64 Liquid||Vega Frontier Edition|
|Base Clock||1200 MHz||1480 MHz||1480 MHz||1607 MHz||1607 MHz||1506 MHz||1406 MHz||1382 MHz|
|Boost Clock||1455 MHz||1582 MHz||1582 MHz||1733 MHz||1683 MHz||1683 MHz||1677 MHz||1600 MHz|
|Memory Clock||1700 MHz MHz||11400 MHz||11000 MHz||10000 MHz||8000 MHz||8000 MHz||1890 MHz||1890 MHz|
|384-bit G5X||352-bit G5X||256-bit G5X||256-bit||256-bit||2048-bit HBM2||2048-bit HBM2|
|Memory Bandwidth||653 GB/s||547 GB/s||484 GB/s||320 GB/s||256 GB/s||256 GB/s||484 GB/s||484 GB/s|
|TDP||250 watts||250 watts||250 watts||180 watts||180 watts||150 watts||345 watts||300 watts|
|Peak Compute||12.2 (base) TFLOPS
14.9 (boost) TFLOPS
|12.1 TFLOPS||11.3 TFLOPS||8.2 TFLOPS||7.8 TFLOPS||5.7 TFLOPS||13.7 TFLOPS||13.1 TFLOPS|
|Peak DP Compute||6.1 (base) TFLOPS
7.45 (boost) TFLOPS
|0.37 TFLOPS||0.35 TFLOPS||0.25 TFLOPS||0.24 TFLOPS||0.17 TFLOPS||0.85 TFLOPS||0.81 TFLOPS|
The current AMD Radeon RX Vega 64, and the Vega Frontier Edition, all ship with a 1:16 FP64 ratio, giving us the equivalent of 256 DP cores per card.
Test Setup and Benchmarks
Our testing setup remains the same from our gaming tests, but obviously the software stack is quite different.
|PC Perspective GPU Testbed|
|Processor||Intel Core i7-5960X Haswell-E|
|Motherboard||ASUS Rampage V Extreme X99|
|Memory||G.Skill Ripjaws 16GB DDR4-3200|
|Storage||OCZ Agility 4 256GB (OS)
Adata SP610 500GB (games)
|Power Supply||Corsair AX1500i 1500 watt|
|OS||Windows 10 x64|
Applications in use include:
- Cinebench R15
- Sisoft Sandra GPU Compute
- SPECviewperf 12.1
Let's not drag this along - I know you are hungry for results! (Thanks to Ken for running most of these tests for us!!)
Subject: Graphics Cards | December 7, 2017 - 01:31 AM | Tim Verry
Tagged: pascal, GTX 1070Ti, GP104, gigabyte, aorus
Gigabyte is jumping into the custom GTX 1070 Ti fray with the Aorus branded GeForce GTX 1070 Ti Aorus. The new custom graphics card measures 280 x 111 x 38mm and features a WindForce 3 cooler with backplate and a custom 6+2 power phase.
Backlit by RGB Fusion LEDs, the Aorus logo sits on the side of the card and can be configured to the color of your choice. The shroud is black with orange accents and has sharp stealth angles that is minimal by comparison with other cards. Gigabyte is using a fairly beefy heatsink with this card. Specifically, three 80mm fans push air over a beefy heatsink that consists of three fin stacks connected by four composite heatpipes. Further, the cooler uses direct contact for the heatpipes above the GPU and a metal plate with thermal pads to cover the GDDR5 memory chips. The rightmost fin stack cools the MOSFETs. Additionally, the full cover backplate adds rigidity to the card and has a copper plate to draw excess heat from the underside of the GPU.
The Aorus graphics card is powered by a single 8-pin PCI-E power connector that feeds a 6+2 power phase. External video outputs include one DVI, one HDMI 2.0b, and three DisplayPort 1.4 ports.
The Pascal-based GP104-300 GPU (2432 CUDA cores, 152 TMUs, and 64 ROPs) is clocked at 1607 MHz base and 1683 MHz boost which is the maximum vendors can clock the cards out of the box. Gigabyte does offer 1-click overclocking using its Aorus Graphics Engine software and guarantees at least 88 MHz overclocks to 1683 MHz base and 1771 MHz boost. Users can, of course, use other software like MSI Afterburner or EVGA Precision X if they wish but will need to use the Gigabyte tool if they want the single click automatic overclock. The 8GB of GDDR5 memory is stock clocked at 8008 MHz and sits on a 256-bit bus.
Looking online, the Aorus GTX 1070 Ti Aorus doesn’t appear to be available for sale quite yet, but it should be coming soon. With the Gigabyte GTX 1070 Ti Gaming card coming in at $469, I’m betting the Aorus card with guaranteed overclock will have a MSRP around $500.
Subject: Graphics Cards | December 6, 2017 - 06:50 PM | Tim Verry
Tagged: evga, ftw, gtx 1070 ti, pascal, overclocking
EVGA is launching a new Pascal-based graphics card with a thicker 2.5 slot cooler in the form of the GeForce GTX 1070 Ti FTW Ultra Silent. The new graphics card has a sleek gray and black shroud with two large black fans in standard ACX 3.0 cooler styling, but with a much thicker cooler that EVGA claims enables more overclocking headroom or a nearly silent fan profile on stock settings.
The GTX 1070 Ti FTW Ultra Silent is powered by two 8-pin PCI-E power connectors that feed a 10+2 power phase and enables the cards 235W TDP (reference TDP is 180 watts). The 2432 Pascal GPU cores are clocked at 1607 MHz base and 1683 MHz boost which aligns with NVIDIA's reference specifications. While there are no guaranteed factory overclock here, EVGA is bundling the dual BIOS card with its EVGA Precision XOC and Precision OC Scanner X software for one-click overclocking that dynamically pushes the clocks up to find the optimal overclock for that specific card. The 8GB of GDDR5 memory is also stock clocked at 8008 MHz. Other features include a backplate, white LEDs, and 2-way SLI support.
Display outputs include one HDMI 2.0b, three DisplayPort 1.4, and one DVI port.
The new FTW series graphics card is available now from the EVGA website for $499.99 and comes with a three year warranty.
The graphics card appears to be rather tall, and I am curious how well the beefier heatsink performs and just how "ultra silent" those fans are! Hopefully we can get one in for testing! The $499.99 MSRP is interesting though because it lines up with the MSRP of the GTX 1080, but with the state of the GPU market as it is the price is not bad and actually comes in about in the middle of where other GTX 1070 Ti cards are at. My guess is they will be snatched up pretty quickly so it's hard to say if it will stay at that price especially on third party sites.
Subject: Graphics Cards | November 7, 2017 - 03:21 PM | Jeremy Hellstrom
Tagged: pascal, nvidia, gtx 1070 ti, geforce, msi
NVIDIA chose to limit the release of their GTX 1070 Ti to reference cards, all sporting the same clocks regardless of the model. That does not mean that the manufacturers skimped on the features which help you overclock successfully. As a perfect example, the MSI GTX 1070 Ti GAMING TITANIUM was built with Hi-C CAPs, Super Ferrite Chokes, and Japanese Solid Caps and 10-phase PWM. This resulted in an impressive overclock of 2050MHz on the GPU and a memory frequency of 9GHz once [H]ard|OCP boosted the power delivered to the card. That boost is enough to meet or even exceed the performance of a stock GTX 1080 or Vega 64 in most of the games they tested.
"NVIDIA is launching the GeForce GTX 1070 Ti today, and we’ve got a custom retail MSI GeForce GTX 1070 Ti GAMING TITANIUM video card to test and overclock, yes overclock, to the max. We’ll make comparisons against GTX 1080/1070, AMD Radeon RX Vega 64 and 56 for a complete review."
Here are some more Graphics Card articles from around the web:
- Nvidia's GeForce GTX 1070 Ti @ The Tech Report
- NVIDIA GeForce GTX 1070 Ti, Takes On The Radeon RX Vega 64 Under Linux @ Phoronix
- MSI GeForce GTX 1070 Ti Titanium 8G @ Guru of 3D
- NVIDIA GeForce GTX 1070 Ti Founders Edition Review @ OCC
- ASUS GTX 1070 Ti STRIX 8 GB @ TechPowerUp
- Colorful iGame GTX 1070 Ti Vulcan X TOP 8 GB @ TechPowerUp
- 34-Way Graphics Card Comparison On Ubuntu 17.10 @ Phoronix
Subject: Graphics Cards | November 2, 2017 - 03:03 PM | Jeremy Hellstrom
Tagged: pascal, nvidia, gtx 1070 ti, geforce
It should come as no surprise to anyone how the GTX 1070 Ti performs, better than a GTX 1070 but not quite as fast as a GTX 1080 ... unless you overclock. With the push of two buttons Ryan was able to hit 1987 MHz which surpasses your average GTX 1080 by a fair margin. Hardware Canucks saw 2088MHz when they overclocked as well as memory of 8.9Gbps which pushed the performance past the reference GTX 1080 in many games. Their benchmark suite encompasses a few different games so you should check to see if your favourites are there.
The real hope of this launch was that prices would change, not so much the actual prices you pay but the MSRP of cards both AMD and NVIDIA. For now that has not happened but perhaps soon it will, though Bitcoin hitting $7000 does not help.
"NVIDIA’s launch of their new GTX 1070 Ti is both senseless and completely sensible depending on which way you tend to look at things. The emotional among you are going to wonder why NVIDIA is even bothering to introduce a new product into a lineup that’s more than a year old."
Here are some more Graphics Card articles from around the web:
- GeForce GTX 1070 Ti Founder Edition @ Guru of 3D
- GeForce GTX 1070 Ti 2-way FCAT SLI @ Guru of 3D
- MSI GeForce GTX 1070 Ti Gaming @ Guru of 3D
- Palit GeForce GTX 1070 Ti Super Jetstream @ Guru of 3D
- Nvidia GTX 1070 Ti review: A fine graphics card—but price remains high @ Ars Technica
- GTX 1070 Ti Review- 35 Games benchmarked @ BabelTechReviews
- MSI GTX 1070 Ti Gaming 8 GB @ TechPowerUp
- NVIDIA GeForce GTX 1070 Ti Founders Edition 8 GB @ TechPowerUp
- A Quick Look At NVIDIA’s GeForce GTX 1070 Ti @ Techgage
- MSI GeForce GTX 1070 Ti Gaming
- MSI GTX 1070 Ti Gaming 8G @ Kitguru
- Palit GTX 1070 Ti Super JetStream 8 GB @ TechPowerUp
- Palit GTX 1070 Ti Super JetStream @ Kitguru
- The NVIDIA GeForce GTX 1080 Ti Founders Edition @ TechARP
- MSI GeForce GTX 1080 Ti GAMING X TRIO @ [H]ard|OCP
- Sapphire RX VEGA 64 Limited Edition @ Modders-Inc
- The AMD Radeon RX Vega 64 @ TechARP
Here comes a new challenger
The release of the GeForce GTX 1070 Ti has been an odd adventure. Launched into a narrow window of a product stack between the GTX 1070 and the GTX 1080, the GTX 1070 Ti is a result of the competition from the AMD RX Vega product line. Sure, NVIDIA might have speced out and prepared an in-between product for some time, but it was the release of competitive high-end graphics cards from AMD (for the first time in forever it seems) that pushed NVIDIA to launch what you see before us today.
With MSRPs of $399 and $499 for the GTX 1070 and GTX 1080 respectively, a new product that fits between them performance wise has very little room to stretch its legs. Because of that, there are some interesting peculiarities involved with the release cycle surrounding overclocks, partner cards, and more.
But before we get into that concoction, let’s first look at the specifications of this new GPU option from NVIDIA as well as the reference Founders Edition and EVGA SC Black Edition cards that made it to our offices!
GeForce GTX 1070 Ti Specifications
We start with our classic table of details.
|RX Vega 64 Liquid||RX Vega 64 Air||RX Vega 56||Vega Frontier Edition||GTX 1080 Ti||GTX 1080||GTX 1070 Ti||GTX 1070|
|Base Clock||1406 MHz||1247 MHz||1156 MHz||1382 MHz||1480 MHz||1607 MHz||1607 MHz||1506 MHz|
|Boost Clock||1677 MHz||1546 MHz||1471 MHz||1600 MHz||1582 MHz||1733 MHz||1683 MHz||1683 MHz|
|Memory Clock||1890 MHz||1890 MHz||1600 MHz||1890 MHz||11000 MHz||10000 MHz||8000 MHz||8000 MHz|
|Memory Interface||2048-bit HBM2||2048-bit HBM2||2048-bit HBM2||2048-bit HBM2||352-bit G5X||256-bit G5X||256-bit||256-bit|
|Memory Bandwidth||484 GB/s||484 GB/s||410 GB/s||484 GB/s||484 GB/s||320 GB/s||256 GB/s||256 GB/s|
|TDP||345 watts||295 watts||210 watts||300 watts||250 watts||180 watts||180 watts||150 watts|
|Peak Compute||13.7 TFLOPS||12.6 TFLOPS||10.5 TFLOPS||13.1 TFLOPS||11.3 TFLOPS||8.2 TFLOPS||7.8 TFLOPS||5.7 TFLOPS|
If you have followed the leaks and stories over the last month or so, the information here isn’t going to be a surprise. The CUDA core count of the GTX 1070 Ti is 2432, only one SM unit less than the GTX 1080. Base and boost clock speeds are the same as the GTX 1080. The memory system includes 8GB of GDDR5 running at 8 GHz, matching the performance of the GTX 1070 in this case. The TDP gets a bump up to 180 watts, in line with the GTX 1080 and slightly higher than the GTX 1070.
Subject: Graphics Cards | October 31, 2017 - 09:58 PM | Scott Michaud
Tagged: nvidia, amazon, google, pascal, Volta, gv100, tesla v100
Remember last month? Remember when I said that Google’s introduction of Tesla P100s would be good leverage over Amazon, as the latter is still back in the Kepler days (because Maxwell was 32-bit focused)?
To compare the two parts, the Tesla P100 has 3584 CUDA cores, yielding just under 10 TFLOPs of single-precision performance. The Tesla V100, with its ridiculous die size, pushes that up over 14 TFLOPs. Same as Pascal, they also support full 1:2:4 FP64:FP32:FP16 performance scaling. It also has access to NVIDIA’s tensor cores, which are specialized for 16-bit, 4x4 multiply-add matrix operations that are apparently common in neural networks, both training and inferencing.
Amazon allows up to eight of them at once (with their P3.16xlarge instances).
So that’s cool. While Google has again been quickly leapfrogged by Amazon, it’s good to see NVIDIA getting wins in multiple cloud providers. This keeps money rolling in that will fund new chip designs for all the other segments.
Subject: General Tech | August 17, 2017 - 12:48 PM | Jeremy Hellstrom
Tagged: nvidia, pascal, grid, tesla, Quadro vDWS
NVIDIA have updated their GRID virtual PC architecture to allow up to 24 virtual desktops, each with a 1GB desktop, doubling the previous capacity of their virtual machine tool. Along with this increase comes a new service called Quadro vDWS which allows you to power those virtual desktops with one of their HPC cards like their Pascal-based line of Tesla GPU accelerators. For workflows which incorporate things such as VR or photorealism this will offer a significant increase in performance; unfortunately Minesweeper will not see any improvements. NVIDIA accompanied this launch with a new blade server, the Tesla P6 which has 16GB of memory which can be split down to 16 1GB virtual desktops. Drop by The Inquirer for more information including on where to get this new software.
"NVIDIA has announced a new software suite which will allow users to virtualise an operating system to turn the company's ridiculously powerful Tesla GPU servers into powerful workstations."
Here is some more Tech News from around the web:
- Nokia 8 vs Galaxy S8 specs comparison @ The Inquirer
- Roku Gets Tough On Pirate Channels, Warns Users @ Slashdot
- Toshiba must allow Western Digital access to joint-venture assets @ The Register
- OCUK’s Andrew Gibson clears up RX Vega64 pricing disaster @ Kitguru
- How to build your own DIY makeshift levitation machine at home @ The Register
Subject: Graphics Cards | June 26, 2017 - 11:29 PM | Tim Verry
Tagged: pascal, nvidia, nicehash, mining, gp106-100, gp104-100, cryptocurrency
In addion to the AMD-based mining graphics cards based on the RX 470 Polaris silicon that have appeared online, NVIDIA and its partners are launching cryptocurrency mining cards based on GP106 and GP104 GPUs. Devoid of any GeForce or GTX branding, these cost controlled cards focused on mining lack the usual array of display outputs and have much shorter warranties (rumors point at a 3 month warranty restriction imposed by NVIDIA). So far Asus, Colorful, EVGA, Inno3D, MSI, and Zotac "P106-100" cards based on GP106 (GTX 1060 equivalent) silicon have been spotted online with Manli and Palit reportedly also working on cards. Many of these manufacturers are also also planning "P104-100" cards based on GP104 or the GTX 1070 though much less information is available at the moment. Pricing is still up in the air but pre-orders are starting to pop up overseas so release dates and prices will hopefully become official soon.
These mining oriented cards appear to be equipped with heatsinks similar to their gaming oriented siblings, but have fans rated for 24/7 operation. Further, while the cards can be overclocked they are clocked out of the box at reference clock speeds and allegedly have bolstered power delivery hardware to keep the cards mining smoothly under 24/7 operation. The majority of cards from NVIDIA partners lack any display outputs (the Colorful card has a single DVI out) which helps a bit with ventilation by leaving both slots vented. These cards are intended to be run in headless system or with systems that also have graphics integrated into the CPU (miners not wanting to waste a PCI-E slot!).
|Base Clock||Boost Clock||Memory (Type)||Pricing|
|ASUS MINING-P106-6G||1506 MHz||1708 MHz||6 GB (GDDR5) @ 8 GHz||$226|
|Colorful P106-100 WK1/WK2||1506 MHz||1708 MHz||6GB (GDDR5) @ 8 GHz||?|
|EVGA GTX1060 6G P106||1506 MHz||1708 MHz||6GB (GDDR5) @ 8 GHz||$284?|
|Inno3D P106-100 Compact||1506 Mhz||1708 MHz||6GB (GDDR5) @ 8 GHz||?|
|Inno3D P106-100 Twin||1506 MHz||1708 MHz||6GB (GDDR5) @ 8 GHz||?|
|MSI P106-100 MINER||1506 MHz||1708 MHz||6GB (GDDR5) @ 8 GHz||$224|
|MSI P104-100 MINER||TDB||TBD||6GB (GDDR5X) @ ?||?|
|ZOTAC P106-100||1506 MHz||1708 MHz||6GB (GDDR5) @ 8 GHz||?|
Looking at the Nicehash Profitability Calculator, the GTX 1060 and GTX 1070 are rated at 20.13 MH/s and 28.69 MH/s at DaggerHashimoto (Etherium) mining respectively with many users able to get a good bit higher hash rates with a bit of overclocking (and in the case of AMD undervolting to optimize power efficiency). NVIDIA cards tend to be good for other algorithms as well such as ZCash and Libry and Equihash (at least those were the majority of coins my 750 Ti mined likely due to it not having the memory to attempt ETH mining heh). The calculator estimates these GPUs at 0.00098942 BTC per day and 0.00145567 BTC per day respectivey. If difficulty and exchange rate were to remains constant that amounts to an income of $1197.95 per year for a GP106 and $1791.73 per year for a GP104 GPU and ROI in under 3 months. Of course cryptocurrency to USD exchange rates will not remain constant, there are transactions and mining fees, and mining difficulty will rise as more hardware is added to the network as miners so these estimated numbers will be lower in reality. Also, these numbers are before electricity, maintainence time, and failed hardware costs, but currently mining alt coins is still very much profitable using graphics cards.
AMD and NVIDIA (and their AIB partners) are hoping to get in on this action with cards binned and tuned for mining and at their rumored prices placing them cheaper than their gaming focused RX and GTX variants miners are sure to scoop these cards up in huge batches (some of the above cards are only availabe in large orders). Hopefully this will alleviate the strain on the gaming graphics card market and bring prices back down closer to their original MSRPs for gamers!
- Mining specific cards are real - ASUS and Sapphire GP106 and RX 470 show up
- First look at Pascal-based GPU cryptocurrency mining station @ Videocardz
- ASUS, COLORFUL and MSI showcase their mining graphics cards @ Videocardz
- Riding the Crypto wave @ TechPowerUP Forums (links/info on mining cards collected here)
- Donate to the PC Perspective Mining Pool! A NiceHash How-to
- Let's Talk About Mining - Cryptocurrency Revisited
- Computex 2017: ASRock Launching H110 Pro BTC+ Motherboard With 13 PCI-E Slots
What are your thoughts on all this GPU mining and cryptocurrency / blockchain technology stuff?
Subject: Graphics Cards | May 23, 2017 - 03:58 PM | Jeremy Hellstrom
Tagged: ek cooling, pascal, nvidia, waterblock, GTX FE
The current series of EK Cooling waterblocks for Pascal based GPUs, up to and including the new Titan X are being replaced with a new family of coolers. The new GTX FE water blocks will be compatible with the previous generation of backplates, so you can do a partial upgrade or keep an eye out for discounts on the previous generation.
These new coolers will fit on any Founders Edition reference card, from GTX 1060's through to the Titan X, currently that count stands at 106 unique graphics cards so your card is likely to be compatible. You can choose between four models, a plain design, one with acetal, one with nickel and one with both acetal and nickel, whichever one you choose it will still run you 109.95€/$125USD
Full PR is below.
EK Water Blocks, the Slovenia-based premium computer liquid cooling gear manufacturer, is releasing several new EK-FC GeForce GTX FE water blocks that are compatible with multiple reference design Founders Edition NVIDIA® GeForce GTX 1060, 1070, 1080, 1080 Ti, Titan X Pascal and Titan Xp based graphics cards. All the water blocks feature recently introduced aesthetic terminal cover as well! FE blocks come as a replacement to current GeForce GTX 10x0 / TITAN X Series of water blocks.
All current GeForce GTX 10x0 / TITAN X Series of water blocks are going to be discontinued after the stock runs out and FE blocks come as a complete replacement. FE blocks are designed to fit all reference design Founders Edition NVIDIA GeForce GTX 1060, 1070, 1080, 1080 Ti, Titan X Pascal and Titan Xp based graphics cards. The current compatibility list rounds up a total of 106 graphics cards that are on the market, but as always, we recommend that you refer to the EK Cooling Configurator for a precise compatibility match.
The new EK-FC GeForce GTX FE water blocks are also backward compatible with all EK-FC1080 GTX Backplates, EK-FC1080 GTX Ti Backplates, and EK-FC Titan X Pascal Backplates.
Availability and pricing
These water blocks are made in Slovenia, Europe and are available for purchase through EK Webshop and Partner Reseller Network. In the table below you can see manufacturer suggested retail price (MSRP) with VAT included.