hUMA's new trick; standardised task dispatch packets

Subject: General Tech | October 23, 2013 - 10:53 AM |
Tagged: amd, hUMA

hUMA's currently well know trick, a shared memory space which both the CPU and GPU can access without penalty is only the first of its revealed optimizations.  The Register talks today about another way in which this new architecture allows the CPU and GPU equal treatment, standardized task queues and dispatch packets which avoid dealing with a kernel level driver to assign tasks.  With hUMA the GPU is able to shedule tasks for the CPU directly.  That would allow any application that was designed to hUMA standards to have its various tasks assigned to the proper processor without needing extra coding.  This not only makes it cheaper and quicker to design apps but would allow all hUMA apps to take advantage of the specialized abilities of both the CPU and GPU at no cost.

reg_hq_1.jpg

"The upcoming chips will utilise a technique AMD calls Heterogeneous Queuing (hQ). This new approach puts the GPU on an equal footing with the CPU: no longer will the graphics engine have to wait for the central processor to tell it what to do."

Here is some more Tech News from around the web:

Tech Talk

Source: The Register

Podcast #266 - Corsair Air 540 Case, MSI GTX 780 Lightning, hUMA in the PS4, and more!

Subject: General Tech | August 29, 2013 - 11:45 AM |
Tagged: xbox one, video, ps4, podcast, msi, hUMA, hsa, gtx 780, corsair, case, amd, air 540, 780 lightning

PC Perspective Podcast #266 - 08/29/2013

Join us this week as we discuss the Corsair Air 540 Case, MSI GTX 780 Lightning, hUMA in the PS4, and more!

You can subscribe to us through iTunes and you can still access it directly through the RSS page HERE.

The URL for the podcast is: http://pcper.com/podcast - Share with your friends!

  • iTunes - Subscribe to the podcast directly through the iTunes Store
  • RSS - Subscribe through your regular RSS reader
  • MP3 - Direct download link to the MP3 file

Hosts: Ryan Shrout, Josh Walrath, Allyn Malventano, and Morry Teitelman

Program length: 1:12:24

  1. Week in Review:
  2. News items of interest:
  3. Hardware/Software Picks of the Week:
    1. Allyn: Samsung 840 EVO pricing was definitely not kidding!
  4. 1-888-38-PCPER or podcast@pcper.com
  5. Closing/outro

Sony PlayStation 4 (PS4) will NOT utilize AMD hUMA with Kabini-based SoC

Subject: General Tech, Graphics Cards, Processors | August 22, 2013 - 10:39 AM |
Tagged: sony, ps4, playstation 4, Kabini, hUMA, amd

UPDATE: I have added new info at the bottom of this post with more commentary from AMD (kind of).

You might have seen some reports in the last couple of days claiming that the upcoming Sony PlayStation 4 (PS4) will have a big advantage over the Xbox One thanks to its unique ability to support AMD's hUMA memory architecture.  hUMA, heterogeneous unified memory architecture, is an exciting new memory technology that AMD has built into upcoming APUs.

huma_01.jpg

Josh published a story on hUMA that sums it as so:

The idea behind hUMA is quite simple; the CPU and GPU share memory resources, they are able to use pointers to access data that has been processed by either one or the other, and the GPU can take page faults and not rely only on page locked memory.  Memory in this case is bi-directionally coherent, so coherency issues with data in caches which are later written to main memory will not cause excessive waits for either the CPU or GPU to utilize data that has been changed in cache, but not yet written to main memory.

ps4-hrdware-large18.jpg

There's just one problem with these various reports (VR-Zone, ExtremeTech): they're incorrect.  After sending some emails to our representatives at AMD I was told that "Kabini doesn't support hUMA" which is the APU that both the PS4 and Xbox One processors are based on.  AMD further clarified with us:

Our spokesperson made inaccurate statements about our semi-custom APU architectures and does not speak for Microsoft, Sony or the AMD semi-custom business unit responsible for co-developing the next generation console APUs.

xboxone.png

So while the PS4 will still be a faster system thanks to its higher SIMD processor (GPU core) count, there is no support for a true heterogeneous unified memory architecture in either upcoming console platform.

NOTE: I have had several people point out that it's possible Sony and Microsoft worked on their own custom memory architectures that will perform similar functionally to hUMA.  That is entirely possible but means that official hUMA support isn't on the SoCs.

UPDATE: AMD contacted me again to make another comment.  Essentially, they said that the correction statement to the original statement claiming hUMA was part PS4 was "inaccurrate" but that this correction does NOT mean the opposite claim is true.  Even when pressed for a more specific and debate-ending comment, AMD wouldn't give us any more information. 

So does the PS4 have support for some type of heterogeneous unified memory?  Maybe.  And the Xbox One?  Maybe.  At this point, I'd stop listening to anything AMD has to say on the subject as they are likely to recant it shortly thereafter.  Many readers have emailed me with their thoughts and I personally feel that its more likely the original statement from AMD (that the PS4 will have the edge with a hUMA design) will turn out to be the truth in the long run...

ASUS Shows Off FM2+ Motherboards Compatible With AMD Kaveri APUs

Subject: Motherboards | July 28, 2013 - 04:40 PM |
Tagged: hUMA, Kaveri, hsa, fm2, asus, APU, A88X, A55, PCI-E 3.0, mATX

ASUS recently announced two new socket FM2+ motherboards that are compatible with AMD’s upcoming “Kaveri” Accelerated Processing Units (APUs). The new boards are the A88XMA and A55BM-A/USB3 and use the A88X and A55 AMD chipsets respectively. Pricing and availability have not yet been released, but the new boards confirm that users will need new motherboards in order to take advantage of AMD’s next generation APUs (though the new FM2+ boards are backwards compatible with the existing APUs, it will not work the other way around). Both motherboards should be available around the time of the Kaveri processor launch (2H 2013).

ASUS A88XMA Kaveri Motherboard (2).jpg

The AMD A88XMA FM2+ Motherboard.

Both the ASUS A88XMA and A55BM-A/USB3 motherboards come in the mATX form factor. The boards both have FM2+ processor sockets and expansion slots including a single PCI-E 3.0 x16 slot, one PCI-E 2.0 x1 slot, and one legacy PCI slot. The support for PCI-E 3.0 is new for AMD motherboards, and the extra bandwidth may prove useful for as graphics cards get faster and AMD works on its hUMA and HSA architectures to create a layer of virtual memory that can be simultaneously addressed by CPUs and GPUs. There will still be latency to deal with over the PCI-E bus, but more data can be moved back and forth in the same amount of time.

ASUS A88XMA Kaveri Motherboard Rear IO.jpg

The two ASUS FM2+ motherboards also share the same rear IO options, which include:

  • 2 x PS/2
  • 3 x Video outputs:
    • 1 x HDMI
    • 1 x DVI
    • 1 x VGA
  • 2 x USB 3.0
  • 2 x USB 2.0
  • 1 x Gigabit Ethernet
  • 3 x Analog audio jacks

The RJ45 jacks are backed by a Realtek 8111G Gigabit Ethernet controller and the audio jacks are handled by a Realtek ALC887-VD chipset. Finally, they also have UEFI BIOSes in common, but from there the two boards diverge in hardware capabilities.

The ASUS A88XMA is the higher-end of the two boards, and features a FM2+ socket, four DDR3 DIMM slots, and six SATA 3 6Gbps ports. It utilizes the AMD A88X chipset which is aimed at enthusiast platforms.

ASUS A55BM-AUSB3 Kaveri Motherboard.jpg

ASUS' A55BM-A/USB3 budget motherboard.

On the other hand, the A55BM-A/USB3 motherboard uses the cheaper A55 chipset. That motherboard features an FM2+ socket, two DDR3 DIMM slots, and six right angle SATA 2 3Gpbs ports. The A55Bm-A/USB3 should be significantly cheaper as a result of the A55 chipset and resulting hardware reductions. In most other respects, ASUS has managed to make the two baords remarkably similar, including aesthetics and basic board layout.

According to Bit-Tech, the two boards are are part of a larger family of boards with the new FM2+ sockets. As such, we should see additional ASUS boards that fill in the gaps between the two models closer to AMD's Kaveri launch. As noted above, ASUS has not provided official pricing or release date information yet.

Source: Bit-Tech

Podcast #249 - Corsair 350D, Frame Rating in 4K, the Oculus Rift and more!

Subject: General Tech | May 2, 2013 - 11:59 AM |
Tagged: podcast, video, Indiegogo, corair, obsidian, 350d, mATX, frame rating, 4k, titan, 7990, 690, Oculus, rift, VR, 3d, amd, amd fx, vishera, hUMA, hsa

PC Perspective Podcast #249 - 05/02/2013

Join us this week as we discuss the Corsair 350D, Frame Rating in 4K, the Oculus Rift and more!

You can subscribe to us through iTunes and you can still access it directly through the RSS page HERE.

The URL for the podcast is: http://pcper.com/podcast - Share with your friends!

  • iTunes - Subscribe to the podcast directly through the iTunes Store
  • RSS - Subscribe through your regular RSS reader
  • MP3 - Direct download link to the MP3 file

Hosts: Josh Walrath, Allyn Malventano, Scott Michaud and Morry Teitelman

Program length: 1:04:02

  1. Week in Review:
  2. News items of interest:
    1. Ryan: Windows Movie Maker lets you record webcam videos!
    2. Jeremy: anti-pick - hotels that don't offer a RJ45 jack in the rooms!
    3. Allyn: Ventev USB charging stuff (home / auto)
  3. 1-888-38-PCPER or podcast@pcper.com
  4. Closing/outro

 

AMD Planning APU13 Developer Summit In San Jose, California

Subject: General Tech | May 1, 2013 - 04:08 AM |
Tagged: hUMA, hsa, apu13, APU, amd, AFDS

AMD announced its third annual Developer Summit last week. Dubbed “APU13,” the upcoming summit is the AMD equivalent to NVIDIA’s GTC and is an annual event that brings together industry analysts, researchers, programmers, academics, and software/hardware companies pursuing heterogeneous computing technologies.

In previous years, the AMD Developer Summit has been the launchpad for C++ AMP and the HSA Foundation. This year’s Summit will continue that trend towards heterogeneous computing as well as look back over the year and provide updates on where the various HSA member companies are at as far as goals to move towards standards-based heterogenous computing.

AMD Logo.png

In addition to keynote speeches from AMD and some of its partners, expect a great deal of presentations and workshops from researchers and programmers that are working on new programming models and hardware solutions to efficiently use CPU and GPU processors. More information on hUMA is one of the likely topics, for example. Discussion about upcoming hardware, process nodes, and products may also be on the table so far as it relates to the HSA theme. Considering the summit is called “APU13,” I also expect that AMD will reveal additional details on the company’s Kaveri APU as well as a look into its future product road map.

AMD is currently asking for presentation proposals from researchers in a number of HSA and technology-related fields including heterogeneous computing, cloud computing, web technologies, programming languages, gaming and graphics technologies, and software security. The lineup of presenters for the summit is still being worked out, and proposal papers will be accepted until May 10th with the winners being notified over the summer.

In all, AMD’s APU13 should be an exciting and intellectual event. Last year’s AMD Fusion Developer Summit (AFDS) was an interesting and fun event to cover, and I hope that APU13 will keep up the same momentum and interest in heterogeneous computing that AFDS started.

Source: AMD

hUMA has come with a weapon to slay the memory latency dragon

Subject: General Tech | April 30, 2013 - 10:23 AM |
Tagged: Steamroller, piledriver, Kaveri, Kabini, hUMA, hsa, GCN, bulldozer, APU, amd

AMD may have united GPU and CPU into the APU but one hurdle had remained until now, the the non-uniformity of memory access between the two processors.  Today we learned about one of the first successful HAS projects called Heterogeneous Uniform Memory Access, aka hUMA, which will appear in the upcoming Kaveri chip family.   The use of this new technology will allow the on-die CPU and GPU to access the same memory pool, both physical and virtual and any data passed between the two processors will remain coherent.  As The Tech Report mentions in their overview hUMA will not provide as much of a benefit to discrete GPUs, while they will be able to share address space the widely differing clock speeds between GDDR5 and DDR3 prevent unification to the level of an APU.

Make sure to read Josh's take as well so you can keep up with him on the Podcast.

huma_02.jpg

"At the Fusion Developer Summit last June, AMD CTO Mark Papermaster teased Kaveri, AMD's next-generation APU due later this year. Among other things, Papermaster revealed that Kaveri will be based on the Steamroller architecture and that it will be the first AMD APU with fully shared memory.

Last week, AMD shed some more light on Kaveri's uniform memory architecture, which now has a snazzy marketing name: heterogeneous uniform memory access, or hUMA for short."

Here is some more Tech News from around the web:

Tech Talk

Author:
Subject: Processors
Manufacturer: AMD

heterogeneous Uniform Memory Access

 

Several years back we first heard AMD’s plans on creating a uniform memory architecture which will allow the CPU to share address spaces with the GPU.  The promise here is to create a very efficient architecture that will provide excellent performance in a mixed environment of serial and parallel programming loads.  When GPU computing came on the scene it was full of great promise.  The idea of a heavily parallel processing unit that will accelerate both integer and floating point workloads could be a potential gold mine in wide variety of applications.  Alas, the promise of the technology did not meet expectations when we have viewed the results so far.  There are many problems with combining serial and parallel workloads between CPUs and GPUs, and a lot of this has to do with very basic programming and the communication of data between two separate memory pools.

huma_01.jpg

CPUs and GPUs do not share common memory pools.  Instead of using pointers in programming to tell each individual unit where data is stored in memory, the current implementation of GPU computing requires the CPU to write the contents of that address to the standalone memory pool of the GPU.  This is time consuming and wastes cycles.  It also increases programming complexity to be able to adjust to such situations.  Typically only very advanced programmers with a lot of expertise in this subject could program effective operations to take these limitations into consideration.  The lack of unified memory between CPU and GPU has hindered the adoption of the technology for a lot of applications which could potentially use the massively parallel processing capabilities of a GPU.

The idea for GPU compute has been around for a long time (comparatively).  I still remember getting very excited about the idea of using a high end video card along with a card like the old GeForce 6600 GT to be a coprocessor which would handle heavy math operations and PhysX.  That particular plan never quite came to fruition, but the idea was planted years before the actual introduction of modern DX9/10/11 hardware.  It seems as if this step with hUMA could actually provide a great amount of impetus to implement a wide range of applications which can actively utilize the GPU portion of an APU.

Click here to continue reading about AMD's hUMA architecture.