Phoronix on OpenCL Driver Optimization, NVIDIA vs. AMD

Subject: Editorial, General Tech, Graphics Cards | December 28, 2012 - 02:43 AM |
Tagged: opencl, nvidia, amd

The GPU is slowly becoming the parallel processing complement to your branching logic-adept CPU. Developers have been slow to adopt this new technology but that does not hinder the hardware manufacturers from putting on a kettle of tea for when guests arrive.

While the transition to GPGPU is slower than I am sure many would like, developers are rarely quick on the uptake of new technologies. The Xbox 360 was one of the first platforms where unified shaders became mandatory and early developers avoided them by offloading vertex code to the CPU. On that note: how much software still gets released without multicore support?

7-TuxGpu.png

Phoronix, practically the arbiter of all Linux news, decided to put several GPU drivers and their manufacturers to the test. AMD was up first and their results showed a pretty sizeable jump in performance at around October of this year through most of their tests. The article on NVIDIA arrived two days later and saw performance trended basically nowhere since February with the 295.20 release.

A key piece of information is that both benchmarks were performed with last generation GPUs: the GTX 460 on the NVIDIA side, with the 6950 holding AMD’s flag. You might note that 295.20 was the last tested driver to be released prior to the launch of Kepler.

These results seem to suggest that upon the launch of Kepler, NVIDIA did practically zero optimizations to their older "Fermi" architecture at least as far as these Linux OpenCL benchmarks are concerned. On the AMD side, it seems as though they are more willing to go back and advance the performance of their prior generation as they release new driver versions.

There are very few instances where AMD beats out NVIDIA in terms of driver support -- it is often a selling point for the jolly green giant -- but this appears to be a definite win for AMD.

Source: Phoronix

Podcast #231 - Intel NUC, AMD 8000M GPUs, Building a Hackintosh and more!

Subject: General Tech | December 20, 2012 - 03:16 PM |
Tagged: video, virtu, VIA, tegra 4, Samsung, radeon, podcast, nvidia, nvelo, nuc, lucid, Intel, hackintosh, gigabyte, Dataplex, arm, amd, 8000m

PC Perspective Podcast #231 - 12/20/2012

Join us this week as we talk about the Intel NUC, AMD 8000M GPUs, Building a Hackintosh and more!

You can subscribe to us through iTunes and you can still access it directly through the RSS page HERE.

The URL for the podcast is: http://pcper.com/podcast - Share with your friends!

  • iTunes - Subscribe to the podcast directly through the iTunes Store
  • RSS - Subscribe through your regular RSS reader
  • MP3 - Direct download link to the MP3 file

Hosts: Ryan Shrout, Josh Walrath, Allyn Malventano and Chris Barbere

This Podcast is brought to you by MSI!

Program length: 1:13:41

Podcast topics of discussion:

  1. 0:01:50 We are going to try Planetside 2 after the podcast!
  2. Week in Reviews:
    1. 0:02:50 Intel Next Unit of Computing NUC
    2. 0:17:55 Corsair AX860i Digital ATX Power Supply
    3. 0:19:00 HP Z1 Workstation All in One
    4. 0:25:00 Building a Hackintosh Computer - A Guide
  3. 0:32:35 This Podcast is brought to you by MSI!
  4. News items of interest:
    1. 0:33:30 Cutting the Cord Complete!
    2. 0:36:10 VIA ARM-based SoCs in upcoming ASUS tablet
    3. 0:42:00 Lucid MVP 2.0 will be sold direct
    4. 0:44:50 Samsung acquires NVELO SSD Caching Software
    5. 0:49:00 AMD announces mobility 8000M series of GPUs
    6. 0:54:15 Some NVIDIA Tegra 4 Details
    7. 0:58:55 NEC Unveils Super Thin Ultrabook
    8. 1:00:30 Win a Sapphire HD 7870 GHz Edition FleX!!
  5. Closing:
    1. 1:02:30 Hardware / Software Pick of the Week
      1. Ryan: Panasonic GH2 Micro 4/3 Camera
      2. Josh: Preparation is key!
      3. Allyn: Cheap RAM
      4. Chris: Had solar panels installed this week
  1. 1-888-38-PCPER or podcast@pcper.com
  2. http://pcper.com/podcast
  3. http://twitter.com/ryanshrout and http://twitter.com/pcper
  4. Closing/outro

Be sure to subscribe to the PC Perspective YouTube channel!!

NVIDIA Tegra 4 Details Revealed By Leaked Slide

Subject: Processors, Mobile | December 19, 2012 - 03:26 AM |
Tagged: wayne, tegra 4, SoC, nvidia, cortex a15, arm

Earlier this year, NVIDIA showed off a roadmap for its Tegra line of mobile system on a chip (SoC) processors. Namely, the next generation Tegra 4 mobile chip is codenamed Wayne and will be the successor to the Tegra 3.

Tegra 4 will use a 28nm manufacturing process and feature improvements to the CPU, GPU, and IO components. Thanks to a leaked slide that appeared on Chip Hell, we now have more details on Tegra 4.

NVIDIA Tegra 4 Leaked Slide.jpg

The 28nm Tegra 4 SoC will keep the same 4+1 CPU design* as the Tegra 3, but it will use ARM Cortex A15 CPU cores instead of the Cortex A9 cores used in the current generation chips. NVIDIA is also improving the GPU portion, and Tegra 4 will reportedly feature a 72 core GPU based on a new architecture. Unfortunately, we do not have specifics on how that GPU is set up architecturally, but the leaked slide indicates that the GPU will be as much as 6x faster than NVIDIA’s own Tegra 3. It will allegedly be fast enough to power displays with resolutions from 1080p @ 120Hz to 4K (refresh rate unknown). Don’t expect to drive games at native 4K resolution, however it should run a tablet OS fine. Interestingly, NVIDIA has included hardware to hardware accelerate VP8 and H.264 video at up to 2560x1440 resolutions.

Additionally, Tegra 4 will feature support for dual channel DDR3L memory, USB 3.0 and hardware accelerated secuity options including HDCP, Secure Boot, and DRM which may make Tegra 4 an attractive option for Windows RT tablets.

The leaked slide has revealed several interesting details on Tegra 4, but it has also raised some questions on the nitty-gritty details. Also, there is no mention of the dual core variant of Tegra 4 – codenamed Grey – that is said to include an integrated Icera 4G LTE cellular modem. Here’s hoping more details surface at CES next month!

* NVIDIA's name for a CPU that features four ARM CPU cores and one lower power ARM companion core.

Source: Chip Hell

How much horsepower do you need to BLOPS2 properly?

Subject: Graphics Cards | December 3, 2012 - 02:02 PM |
Tagged: nvidia, call of duty, black ops 2, amd

[H]ard|OCP set out to determine how well AMD and NVIDIA's cards can deal with the new Call of Duty game.  To do so they took a system built on a GIGABYTE Z77X-UP4-TH, a Core i7 2600k @ 4.8GHz, and 8GB of Corsair RAM and then tested a HD7970, 7950 and 7870 as well as a GTX680, 670 and 660Ti.  There is good news for both graphics companies and gamers, the HD7870 was the slowest card and still managed great performance on maximum settings @ 2560x1600 with 8X MSAA and FXAA.  For the absolute best performance it is NVIDIA's GTX680 that is your go to card though since this is a console port, albeit one that [H] describes as well implemented, don't expect to be blown away by the quality of the graphics.

Hoptions.jpg

"Call of Duty: Black Ops II is the first Call of Duty game on PC to support DX11 and new graphical features. Hopefully improvements to the IW Engine will be enough to boost the CoD franchise near the top graphics-wise. We also examine NVIDIA's TXAA technology which combines shader based antialiasing and traditional multisampling AA."

Here are some more Graphics Card articles from around the web:

Graphics Cards

Source: [H]ard|OCP

Too good to be true; bad coding versus GPGPU compute power

Subject: General Tech | November 23, 2012 - 01:03 PM |
Tagged: gpgpu, amd, nvidia, Intel, phi, tesla, firepro, HPC

The skeptics were right to question the huge improvements seen when using GPGPUs in a system for heavy parallel computing tasks.  The cards do help a lot but the 100x improvements that have been reported by some companies and universities had more to do with poorly optimized CPU code than with the processing power of GPGPUs.  This news comes from someone who you might not expect to burst this particular bubble, Sumit Gupta is the GM of NVIDIA's Tesla team and he might be trying to mitigate any possible disappointment from future customers which have optimized CPU coding and won't see the huge improvements seen by academics and other current customers.  The Inquirer does point out a balancing benefit, it is obviously much easier to optimize code in CUDA, OpenCL and other GPGPU languages than it is to code for multicored CPUs.

bubble-burst.jpg

"Both AMD and Nvidia have been using real-world code examples and projects to promote the performance of their respective GPGPU accelerators for years, but now it seems some of the eye popping figures including speed ups of 100x or 200x were not down to just the computing power of GPGPUs. Sumit Gupta, GM of Nvidia's Tesla business told The INQUIRER that such figures were generally down to starting with unoptimised CPU."

Here is some more Tech News from around the web:

Tech Talk

Source: The Inquirer

GeForce GTX Call of Duty Rivalries competition ...and more

Subject: General Tech, Graphics Cards | November 15, 2012 - 04:05 PM |
Tagged: nvidia, call of duty, black ops 2

header-rivalries.jpg

NVIDIA will be celebrating the release of Call of Duty: Black Ops II by launching the first-ever “GeForce GTX Call of Duty Rivalries” competition which pits top colleges against each other in Call of Duty: Black Ops II four-person, last team standing multiplayer matches. Participants in the first round of competition include the storied rivalries of Cal vs. Stanford, USC vs. UCLA and UNC vs. NC State. Two additional wildcard colleges from any accredited college in the United States will also be chosen by the Facebook community to field teams. See details on GeForce.com or visit NVIDIA’s Facebook page on how you can walk away with a Maingear gaming rig.

logo_geforce.png

In addition to the contest NVIDIA also released the GeForce 310.54 beta driver with specific benefits for players of Black Ops 2, specifically the inclusion of TXAA.

  • Delivers up to 26 percent faster performance in Call of Duty: Black Ops 2 and up to 18% faster performance in Assassin’s Creed III.
  • Provides smooth, shimmer-free graphics with NVIDIA TXAA antialiasing in Call of Duty: Black Ops 2 and Assassin’s Creed III.
  • Improves performance by up to 16% in other top games likes Battlefield 3, The Elder Scrolls V: Skyrim, and StarCraft II.

As always, our new driver includes new profiles for today’s top titles, increasing multi-GPU performance.

  • Hawken – Added SLI profile
  • Hitman: Absolution – Added SLI profile
  • Natural Selection 2 – Added SLI profile
  • Primal Carnage – Added SLI profile

You can grab the driver and read about all the improvements right here.

Source: NVIDIA

Podcast #227 - Golden Z77 Motherboard from ECS, High Powered WiFi from Amped Wireless, Supercomputing GPUs and more!

Subject: General Tech | November 15, 2012 - 02:10 PM |
Tagged: titan, thor, tesla, s1000, podcast, nvidia, k20x, Intel, golden board, firepro, ECS, dust, Amped Wireless, amd

PC Perspective Podcast #227 - 11/15/2012

Join us this week as we talk about a Golden Z77 Motherboard from ECS, High Powered WiFi from Amped Wireless, Supercomputing GPUs and more!

You can subscribe to us through iTunes and you can still access it directly through the RSS page HERE.

The URL for the podcast is: http://pcper.com/podcast - Share with your friends!

  • iTunes - Subscribe to the podcast directly through the iTunes Store
  • RSS - Subscribe through your regular RSS reader
  • MP3 - Direct download link to the MP3 file

Hosts: Ryan Shrout, Jeremy Hellstrom, Josh Walrath, and Allyn Malventano

This Podcast is brought to you by MSI!

Program length: 1:07:04

Podcast topics of discussion:

  1. Join us for the Hitman: Absolution Game Stream
  2. Week in Reviews:
    1. 0:02:00 ECS Z77H2-AX Golden Board Motherboard
    2. 0:07:00 Amped Wireless R20000G Router and Adapter
    3. 0:12:20 Intel says USB 3.0 and 2.4 GHz don't get along
  3. 0:18:00 This Podcast is brought to you by MSI!
  4. News items of interest:
    1. 0:19:00 A renaissance of game types that have been sadly missing
    2. 0:24:00 You missed our live Medal of Honor Game Stream - loser!
    3. 0:26:12 NVIDIA launches Tesla K20X Card, Powers Titan Supercomputer
    4. 0:30:15 AMD Launches Dual Tahiti FirePro S10000
    5. 0:38:00 Some guy leaves Microsoft - is the Start Menu on its way back??
    6. 0:41:40 AMD is apparently not for sale
    7. 0:46:05 ECS joins the Thunderbolt family with a new Z77 motherboard
  5. Closing:
    1. 0:54:00 Hardware / Software Pick of the Week
      1. Ryan: Corsair Hydro Series H60 for $75
      2. Jeremy: Form over function or vice versa?
      3. Josh: A foundation worth donating to
      4. Allyn: ArmorSuit Military Shields
  1. 1-888-38-PCPER or podcast@pcper.com
  2. http://pcper.com/podcast
  3. http://twitter.com/ryanshrout and http://twitter.com/pcper
  4. Closing/outro

Be sure to subscribe to the PC Perspective YouTube channel!!

 

 

The driving force behind revisting Radeon and GeForce cards

Subject: Graphics Cards | November 15, 2012 - 01:43 PM |
Tagged: amd, nvidia, catalyst, forceware

We have seen quite a few driver updates since the release of cards like the HD 7970 GHz Edition and the GTX 680 which inspired [H]ard|OCP to revisit the performance of these cards in several games.  Some results were not surprising, the two top cards have historically run neck and neck in performance and price and that remains true now.  There was a definite loser however, the performance of the GTX 660 Ti matches that of the HD7870 but the price is similar to the much faster HD7950.  Check out the full results here.

H_gputime.jpg

"With the recent release of new beta drivers from both AMD and NVIDIA, and the upping of clocks by AMD, significant performance gains have been claimed by both parties for current generation video cards. We will investigate with a 6-way roundup comparison to see if we can crown a champion."

Here are some more Graphics Card articles from around the web:

Graphics Cards

Source: [H]ard|OCP

NVIDIA Launches Tesla K20X Accelerator Card, Powers Titan Supercomputer

Subject: General Tech | November 12, 2012 - 06:29 AM |
Tagged: tesla, supercomputer, nvidia, k20x, HPC, CUDA, computing

Graphics card manufacturer NVIDIA launched a new Tesla K20X accelerator card today that supplants the existing K20 as the top of the line model. The new card cranks up the double and single precision floating point performance, beefs up the memory capacity and bandwidth, and brings some efficiency improvements to the supercomputer space.

NVIDIA_Tesla_K20X_K20_GPU_Accelerator.jpg

While it is not yet clear how many CUDA cores the K20X has, NVIDIA has stated that it is using the GK110 GPU, and is running with 6GB of memory with 250 GB/s of bandwidth – a nice improvement over the K20’s 5GB at 208 GB/s. Both the new K20X and K20 accelerator cards are based on the company’s Kepler architecture, but NVIDIA has managed to wring out more performance from the K20X. The K20 is rated at 1.17 TFlops peak double precision and 3.52 TFlops peak single precision while the K20X is rated at 1.31 TFlops and 3.95 TFlops.

Screenshot (363).png

The K20X manages to score 1.22 TFlops in DGEmm, which puts it at almost three times faster than the previous generation Tesla M2090 accelerator based on the Fermi architecture.

Screenshot (362).png

Aside from pure performance, NVIDIA is also touting efficiency gains with the new K20X accelerator card. When two K20X cards are paired with a 2P Sandy Bridge server, NVIDIA claims to achieve 76% efficiency versus 61% efficiency with a 2P Sandy Bridge server equipped with two previous generation M2090 accelerator cards. Additionally, NVIDIA claims to have enabled the Titan supercomputer to reach the #1 spot on the top 500 green supercomputers thanks to its new cards with a rating of 2,120.16 MFLOPS/W (million floating point operations per second per watt).

Screenshot (359).png

NVIDIA claims to have already shipped 30 PFLOPS worth of GPU accelerated computing power. Interestingly, most of that computing power is housed in the recently unveiled Titan supercomputer. This supercomputer contains 18,688 Tesla K20X (Kepler GK110) GPUs and 299,008 16-core AMD Opteron 6274 processors. It will consume 9 megawatts of power and is rated at a peak of 27 Petaflops and 17.59 Petaflops during a sustained Linpack benchmark. Further, when compared to Sandy Bridge processors, the K20 series offers up between 8.2 and 18.1 times more performance at several scientific applications.

Screenshot (360).png

While the Tesla cards undoubtedly use more power than CPUs, you need far fewer numbers of accelerator cards than processors to hit the same performance numbers. That is where NVIDIA is getting its power efficiency numbers from.

NVIDIA is aiming the accelerator cards at researchers and businesses doing 3D graphics, visual effects, high performance computing, climate modeling, molecular dynamics, earth science, simulations, fluid dynamics, and other such computationally intensive tasks. Using CUDA and the parrallel nature of the GPU, the Tesla cards can acheive performance much higher than a CPU-only system can. NVIDIA has also engineered software to better parrellelize workloads and keep the GPU accelerators fed with data that the company calls Hyper-Q and Dynamic Parallelism respectively.

It is interesting to see NVIDIA bring out a new flagship, especially another GK110 card. Systems using the K20 and the new K20X are available now with cards shipping this week and general availability later this month.

You can find the full press release below and a look at the GK110 GPU in our preview.

Anandtech also managed to get a look inside the Titan supercomputer at Oak Ridge National Labratory, where you can see the Tesla K20X cards in action.

Podcast #226 - Dual GTX 690 System from Origin, Intel's new SATA6 controller, Piledriver-based Opeterons and more!

Subject: General Tech | November 8, 2012 - 01:33 PM |
Tagged: ssd, sata6, podcast, piledriver, pcper, origin, opeteron, nvidia, Intel, genesis, corsair, amd, 690

PC Perspective Podcast #226 - 11/08/2012

Join us this week as we talk about a Dual GTX 690 System from Origin, Intel's new SATA6 controller, Piledriver-based Opeterons and more!

You can subscribe to us through iTunes and you can still access it directly through the RSS page HERE.

The URL for the podcast is: http://pcper.com/podcast - Share with your friends!

  • iTunes - Subscribe to the podcast directly through the iTunes Store
  • RSS - Subscribe through your regular RSS reader
  • MP3 - Direct download link to the MP3 file

Hosts: Ryan Shrout, Jeremy Hellstrom, Josh Walrath, and  Allyn Malventano

This Podcast is brought to you by MSI!

Program length: 1:21:17

Podcast topics of discussion:

  1. Join us for the MoH Game Stream!
  2. Week in Reviews:
    1. 0:04:30 Corsair Vengeance C70 Case
    2. 0:07:30 ASUS P8Z77 WS Motherboard
    3. 0:12:20 ORIGIN Genesis Dual GTX 690 System
    4. 0:16:40 Silverstone 450 watt SFX Power Supply
  3. 0:19:30 This podcast is brought to you by MSI
  4. News items of interest:
    1. 0:20:25 Intel Crystal Forest Communications Platform
    2. 0:23:30 Google Nexus 10 tablet
    3. 0:27:00 Corsair Hydro H100i and H80i coolers
    4. 0:34:00 New Corsair AXi series power supplies
    5. 0:36:30 Intel DC S3700 Enterprise SSD
    6. 0:46:30 AMD Launches Piledriver based Opteron 6300 chips
    7. 0:51:10 Get Assassin's Creed III for Samsung SSD
    8. 0:52:45 Limited Linux Steam Beta starts
    9. 0:56:15 Zotac AD06 with new AMD APU
    10. 0:58:30 Mouse.. DRM!?
  5. Closing:
    1. Hardware / Software Pick of the Week
      1. Ryan: Corsair Vengence MM200 and MM400 Mouse Mats
      2. Jeremy: Movember and Is this thing on or did it crash? or NewEgg
      3. Josh: Everyone needs a mouse
      4. Allyn: Shure SE315-CL and CBL-M+-K
  1. 1-888-38-PCPER or podcast@pcper.com
  2. http://pcper.com/podcast
  3. http://twitter.com/ryanshrout and http://twitter.com/pcper
  4. Closing/outro

Be sure to subscribe to the PC Perspective YouTube channel!!