How deep is your learning?

Recently, we've had some hands-on time with NVIDIA's new TITAN V graphics card. Equipped with the GV100 GPU, the TITAN V has shown us some impressive results in both gaming and GPGPU compute workloads.

However, one of the most interesting areas that NVIDIA has been touting for GV100 has been deep learning. With a 1.33x increase in single-precision FP32 compute over the Titan Xp, and the addition of specialized Tensor Cores for deep learning, the TITAN V is well positioned for deep learning workflows.

In mathematics, a tensor is a multi-dimensional array of numerical values with respect to a given basis. While we won't go deep into the math behind it, Tensors are a crucial data structure for deep learning applications.

07.jpg

NVIDIA's Tensor Cores aim to accelerate Tensor-based math by utilizing half-precision FP16 math in order to process both dimensions of a Tensor at the same time. The GV100 GPU contains 640 of these Tensor Cores to accelerate FP16 neural network training.

It's worth noting that these are not the first Tensor operation-specific hardware, with others such as Google developing hardware for these specific functions.

Test Setup

  PC Perspective Deep Learning Testbed
Processor AMD Ryzen Threadripper 1920X
Motherboard GIGABYTE X399 AORUS Gaming 7
Memory 64GB Corsair Vengeance RGB DDR4-3000 
Storage Samsung SSD 960 Pro 2TB
Power Supply Corsair AX1500i 1500 watt
OS Ubuntu 16.04.3 LTS
Drivers AMD: AMD GPU Pro 17.50
NVIDIA: 387.34

For our NVIDIA testing, we used the NVIDIA GPU Cloud 17.12 Docker containers for both TensorFlow and Caffe2 inside of our Ubuntu 16.04.3 host operating system.

AMD testing was done using the hiptensorflow port from the AMD ROCm GitHub repositories.

For all tests, we are using the ImageNet Large Scale Visual Recognition Challenge 2012 (ILSVRC2012) data set.

Continue reading our look at deep learning performance with the NVIDIA Titan V!!

Author:
Manufacturer: NVIDIA

Looking Towards the Professionals

This is a multi-part story for the NVIDIA Titan V:

Earlier this week we dove into the new NVIDIA Titan V graphics card and looked at its performacne from a gaming perspective. Our conclusions were more or less what we expected - the card was on average ~20% faster than the Titan Xp and about ~80% faster than the GeForce GTX 1080. But with that $3000 price tag, the Titan V isn't going to win any enthusiasts over.

What the Titan V is meant for in reality is the compute space. Developers, coders, engineers, and professionals that use GPU hardware for research, for profit, or for both. In that case, $2999 for the Titan V is simply an investment that needs to show value in select workloads. And though $3000 is still a lot of money, keep in mind that the NVIDIA Quadro GP100, the most recent part with full-performance double precision compute from the Pascal chip, is still selling for well over $6000 today. 

IMG_5009.JPG

The Volta GV100 GPU offers 1:2 double precision performance, equating to 2560 FP64 cores. That is a HUGE leap over the GP102 GPU used on the Titan Xp that uses a 1:32 ratio, giving us just 120 FP64 cores equivalent.

  Titan V Titan Xp GTX 1080 Ti GTX 1080 GTX 1070 Ti GTX 1070 RX Vega 64 Liquid Vega Frontier Edition
GPU Cores 5120 3840 3584 2560 2432 1920 4096 4096
FP64 Cores 2560 120 112 80 76 60 256 256
Base Clock 1200 MHz 1480 MHz 1480 MHz 1607 MHz 1607 MHz 1506 MHz 1406 MHz 1382 MHz
Boost Clock 1455 MHz 1582 MHz 1582 MHz 1733 MHz 1683 MHz 1683 MHz 1677 MHz 1600 MHz
Texture Units 320 240 224 160 152 120 256 256
ROP Units 96 96 88 64 64 64 64 64
Memory 12GB 12GB 11GB 8GB 8GB 8GB 8GB 16GB
Memory Clock 1700 MHz MHz 11400 MHz 11000 MHz 10000 MHz 8000 MHz 8000 MHz 1890 MHz 1890 MHz
Memory Interface 3072-bit
HBM2
384-bit G5X 352-bit G5X 256-bit G5X 256-bit 256-bit 2048-bit HBM2 2048-bit HBM2
Memory Bandwidth 653 GB/s 547 GB/s 484 GB/s 320 GB/s 256 GB/s 256 GB/s 484 GB/s 484 GB/s
TDP 250 watts 250 watts 250 watts 180 watts 180 watts 150 watts 345 watts 300 watts
Peak Compute 12.2 (base) TFLOPS
14.9 (boost) TFLOPS
12.1 TFLOPS 11.3 TFLOPS 8.2 TFLOPS 7.8 TFLOPS 5.7 TFLOPS 13.7 TFLOPS 13.1 TFLOPS
Peak DP Compute 6.1 (base) TFLOPS
7.45 (boost) TFLOPS
0.37 TFLOPS 0.35 TFLOPS 0.25 TFLOPS 0.24 TFLOPS 0.17 TFLOPS 0.85 TFLOPS 0.81 TFLOPS
MSRP (current) $2999 $1299 $699 $499 $449 $399 $699 $999

The current AMD Radeon RX Vega 64, and the Vega Frontier Edition, all ship with a 1:16 FP64 ratio, giving us the equivalent of 256 DP cores per card.

Test Setup and Benchmarks

Our testing setup remains the same from our gaming tests, but obviously the software stack is quite different. 

  PC Perspective GPU Testbed
Processor Intel Core i7-5960X Haswell-E
Motherboard ASUS Rampage V Extreme X99
Memory G.Skill Ripjaws 16GB DDR4-3200
Storage OCZ Agility 4 256GB (OS)
Adata SP610 500GB (games)
Power Supply Corsair AX1500i 1500 watt
OS Windows 10 x64
Drivers AMD: 17.10.2
NVIDIA: 388.59

Applications in use include:

  • Luxmark 
  • Cinebench R15
  • VRay
  • Sisoft Sandra GPU Compute
  • SPECviewperf 12.1
  • FAHBench

Let's not drag this along - I know you are hungry for results! (Thanks to Ken for running most of these tests for us!!)

Continue reading part 2 of our Titan V review on compute performance!!

Podcast #479 - NVIDIA Titan V, AMD Adrenalin, and more!

Subject: General Tech | December 14, 2017 - 12:09 PM |
Tagged: video, vesa, toshiba, titan v, synaptics, Silverstone, shazam, radeon, podcast, PBT, nvidia, nervana, keylogger, jonsbo, Intel, hp, hdr, corsair, Clear ID, apple, amd, Adrenalin, 14tb

PC Perspective Podcast #479 - 12/14/17

Join us for discussion on NVIDIA Titan V, AMD Adrenalin, and more!

You can subscribe to us through iTunes and you can still access it directly through the RSS page HERE.

The URL for the podcast is: http://pcper.com/podcast - Share with your friends!

Hosts: Ryan Shrout, Josh Walrath, Jeremy Hellstrom, Allyn Malventano,

Peanut Gallery: Ken Addison, Alex Lustenberg

Program length: 1:12:23

Podcast topics of discussion:
  1. Week in Review:
  2. 0:38:15 AD:  Hello Fresh
  3. News items of interest:
  4. Picks of the Week:
    1. 1:06:15 Allyn: Authy
  5. Closing/outro

Source:
Author:
Manufacturer: NVIDIA

A preview of potential Volta gaming hardware

This is a multi-part story for the NVIDIA Titan V:

As a surprise to most of us in the media community, NVIDIA launched a new graphics card to the world, the TITAN V. No longer sporting the GeForce brand, NVIDIA has returned the Titan line of cards to where it began – clearly targeted at the world of developers and general purpose compute. And if that branding switch isn’t enough to drive that home, I’m guessing the $2999 price tag will be.

Today’s article is going to look at the TITAN V from the angle that is likely most interesting to the majority of our readers, that also happens to be the angle that NVIDIA is least interested in us discussing. Though targeted at machine learning and the like, there is little doubt in my mind that some crazy people will want to take on the $3000 price to see what kind of gaming power this card can provide. After all, this marks the first time that a Volta-based GPU from NVIDIA has shipped in a place a consumer can get their hands on it, and the first time it has shipped with display outputs. (That’s kind of important to build a PC around it…)

IMG_4999.JPG

From a scientific standpoint, we wanted to look at the Titan V for the same reasons we tested the AMD Vega Frontier Edition cards upon their launch: using it to estimate how future consumer-class cards will perform in gaming. And, just as we had to do then, we purchased this Titan V from NVIDIA.com with our own money. (If anyone wants to buy this from me to recoup the costs, please let me know! Ha!)

  Titan V Titan Xp GTX 1080 Ti GTX 1080 GTX 1070 Ti GTX 1070 RX Vega 64 Liquid Vega Frontier Edition
GPU Cores 5120 3840 3584 2560 2432 1920 4096 4096
Base Clock 1200 MHz 1480 MHz 1480 MHz 1607 MHz 1607 MHz 1506 MHz 1406 MHz 1382 MHz
Boost Clock 1455 MHz 1582 MHz 1582 MHz 1733 MHz 1683 MHz 1683 MHz 1677 MHz 1600 MHz
Texture Units 320 240 224 160 152 120 256 256
ROP Units 96 96 88 64 64 64 64 64
Memory 12GB 12GB 11GB 8GB 8GB 8GB 8GB 16GB
Memory Clock 1700 MHz MHz 11400 MHz 11000 MHz 10000 MHz 8000 MHz 8000 MHz 1890 MHz 1890 MHz
Memory Interface 3072-bit
HBM2
384-bit G5X 352-bit G5X 256-bit G5X 256-bit 256-bit 2048-bit HBM2 2048-bit HBM2
Memory Bandwidth 653 GB/s 547 GB/s 484 GB/s 320 GB/s 256 GB/s 256 GB/s 484 GB/s 484 GB/s
TDP 250 watts 250 watts 250 watts 180 watts 180 watts 150 watts 345 watts 300 watts
Peak Compute 12.2 (base) TFLOPS
14.9 (boost) TFLOPS
12.1 TFLOPS 11.3 TFLOPS 8.2 TFLOPS 7.8 TFLOPS 5.7 TFLOPS 13.7 TFLOPS 13.1 TFLOPS
MSRP (current) $2999 $1299 $699 $499   $399 $699 $999

The Titan V is based on the GV100 GPU though with some tweaks that lower performance and capability slightly when compared to the Tesla-branded equivalent hardware. Though our add-in card iteration has the full 5120 CUDA cores enabled, the HBM2 memory bus is reduced from 4096-bit to 3072-bit and it has one of the four stacks on the package disabled. This also drops the memory capacity from 16GB to 12GB, and memory bandwidth to 652.8 GB/s.

Continue reading our gaming review of the NVIDIA Titan V!!

Video: What does a $3000 GPU look like? NVIDIA TITAN V Unboxing and Teardown!

Subject: Graphics Cards | December 12, 2017 - 07:51 PM |
Tagged: nvidia, titan, titan v, Volta, video, teardown, unboxing

NVIDIA launched the new Titan V graphics card last week, a $2999 part targeted not at gamers (thankfully) but instead at developers of machine learning applications. Based on the GV100 GPU and 12GB of HBM2 memory, the Titan V is an incredibly powerful graphics card. We have every intention of looking at the gaming performance of this card as a "preview" of potential consumer Volta cards that may come out next year. (This is identical to our stance of testing the Vega Frontier Edition cards.)

But for now, enjoy this unboxing and teardown video that takes apart the card to get a good glimpse of that GV100 GPU.

A couple of quick interesting notes:

  • This implementation has 25% of the memory and ROPs disabled, giving us 12GB of HBM2, a 3072-bit bus, and 96 ROPs.
  • Clock speeds in our testing look to be much higher than the base AND boost ratings.
  • So far, even though the price takes this out of the gaming segment completely, we are impressed with some of the gaming results we have found.
  • The cooler might LOOK the same, but it definitely is heavier than the cooler and build for the Titan Xp.
  • Champagne. It's champagne colored.
  • Double precision performance is insanely good, spanking the Titan Xp and Vega so far in many tests.
  • More soon!

gv100.png

Source: NVIDIA

NVIDIA Launches Titan V, the World's First Consumer Volta GPU with HBM2

Subject: Graphics Cards | December 7, 2017 - 11:44 PM |
Tagged: Volta, titan, nvidia, graphics card, gpu

NVIDIA made a surprising move late Thursday with the simultaneous announcement and launch of the Titan V, the first consumer/prosumer graphics card based on the Volta architecture.

NVIDIA_TITAN V_KV.jpeg

Like recent flagship Titan-branded cards, the Titan V will be available exclusively from NVIDIA for $2,999. Labeled "the most powerful graphics card ever created for the PC," Titan V sports 12GB of HBM2 memory, 5120 CUDA cores, and a 1455MHz boost clock, giving the card 110 teraflops of maximum compute performance. Check out the full specs below:

6 Graphics Processing Clusters
80 Streaming Multiprocessors
5120 CUDA Cores (single precision)
320 Texture Units
640 Tensor Cores
1200 MHz Base Clock (MHz)
1455 MHz Boost Clock (MHz)
850 MHz Memory Clock
1.7 Gbps Memory Data Rate
4608K L2 Cache Size
12288 MB HBM2 Total Video Memory
3072-bit Memory Interface
652.8 GB/s Total Memory Bandwidth
384 GigaTexels/sec Texture Rate (Bilinear)
12 nm Fabrication Process (TSMC 12nm FFN High Performance)
21.1 Billion Transistor Count
3 x DisplayPort, 1 x HDMI Connectors
Dual Slot Form Factor
One 6-pin, One 8-pin Power Connectors
600 Watts Recommended Power Supply
250 Watts Thermal Design Power (TDP)

The NVIDIA Titan V's 110 teraflops of compute performance compares to a maximum of about 12 teraflops on the Titan Xp, a greater than 9X increase in a single generation. Note that this is a very specific claim though, and references the AI compute capability of the Tensor cores rather than we traditionally measure for GPUs (single precision FLOPS). In that metric, the Titan V only truly offers a jump to 14 TFLOPS. The addition of expensive HBM2 memory also adds to the high price compared to its predecessor.

titan-v-stylized-photography-6.jpeg

The Titan V is available now from NVIDIA.com for $2,999, with a limit of 2 per customer. And hey, there's free shipping too.

Source: NVIDIA

Nvidia GeForce 388.43 WHQL driver

Subject: Graphics Cards | November 30, 2017 - 01:31 PM |
Tagged: GeForce 388.43, nvidia, whql, DOOM VFR, NV Tray

It might sound like an experimental Nazi weapon from WWII, the DOOM VFR has little to do with Wolfenstein and is instead a DOOM virtual reality game for the HTC Vive (pre-purchasing is bad, m'kay).   NVIDIA's new game ready driver, the GeForce 388.43 WHQL release is made to improve the performance of your GTX in this game.

vfr.PNG

This release also marks the return of the NVTray, much to the delight of the hoards of users who mourned the loss of the utility.  You can grab the drivers here, or through the GeForce Experience app if you have it installed.

 

Source: NVIDIA

An AIDA64 suite update with a surprise

Subject: General Tech | November 27, 2017 - 04:27 PM |
Tagged: aida64, Core i9-8950HK, Intel, leak, amd, nvidia

An update to AIDA's suite of tools was announced today, with several major update to the Extreme software as well as Engineer, Business and Network auditing products as well.  Some of the changes are minor, such as support for Win10 Creators Edition but there are also updates to support Threadripper, the new Tesla V100 series and all three new chipsets from AMD and Intel.  You may have noticed Scott's post below, which details the surprise contained in the release notes; the Core i9-8950HK, i7-8850H, i7-8750H, i5-8400H and i3-8300H are all new processors, based on Coffee Lake.

Capture.PNG

"FinalWire Ltd. today announced the immediate availability of AIDA64 Extreme 5.95 software, a streamlined diagnostic and benchmarking tool for home users; the immediate availability of AIDA64 Engineer 5.95 software, a professional diagnostic and benchmarking solution for corporate IT technicians and engineers; the immediate availability of AIDA64 Business 5.95 software, an essential network management solution for small and medium scale enterprises; and the immediate availability of AIDA64 Network Audit 5.95 software, a dedicated network audit toolset to collect and manage corporate network inventories."

Here is some more Tech News from around the web:

Tech Talk

Source: AIDA64

XSPC Razor Neo Waterblock is pretty, effective

Subject: Graphics Cards | November 23, 2017 - 01:30 PM |
Tagged: watercooler, gtx 1080 ti, nvidia, XSPC, Razer Neo

It seems a shame to hide the XSPC Razor Neo watercooler for the GTX 1080 Ti as you will not easily see the polished nickel plated copper waterblock and tempered glass window XSPC used.  [H]ard|OCP found the design to be very scratch resistant and it allows you to completely avoid the cracks which acrylic inevitably develops as it ages.  This waterblock is not just decorative, [H] found the card would hit and remain at 2100.5MHz in game, with temperatures never exceeding 33C, with or without the Frag Harder Disco Lights going.

1511290150fvx2g4emm9_1_2_l.jpg

"If you are thinking about delving in water cooling your high end NVIDIA GTX 1080 or 1080 Ti video card, the XSPC Razor Neo is certainly worthy of being on your short list. Outside of its incredibly good looks, Frag Harder Disco Lights, and easy install process, does it work well when it comes to overclocking and cooling your GTX 1080 Ti?"

Here are some more Graphics Card articles from around the web:

Graphics Cards

Source: [H]ard|OCP

Need for GPU Speed

Subject: General Tech | November 22, 2017 - 02:34 PM |
Tagged: gaming, need for speed payback, nvidia, amd

The new Need for Speed Payback uses the familiar Frostbite 3 game engine, so we have some general idea how various cards will perform.  There is a feature used in the game that changes how AMD cards perform however, this game makes use of the AMD GPU Services (AGS) library which should make their cards more effective.  [H]ard|OCP's testing did show a close race, apart from the unmatched GTX 1080Ti AMD's cards offer competitive performance and even offering taking the lead at some resolutions.  Drop by to take a look at the details.

1511081856yzfeha3mnu_1_1.png

"Need for Speed Payback is out, we’ll look at feature performance and video card performance comparisons in today’s latest video cards. We’ll find what’s playable, and examine graphics quality setting performance among eight video cards. We will also find out VRAM and CPU usage of this new game so you pick the right video card for gaming. "

Here is some more Tech News from around the web:

Gaming

Source: [H]ard|OCP