AMD announced its third annual Developer Summit last week. Dubbed “APU13,” the upcoming summit is the AMD equivalent to NVIDIA’s GTC and is an annual event that brings together industry analysts, researchers, programmers, academics, and software/hardware companies pursuing heterogeneous computing technologies.
In previous years, the AMD Developer Summit has been the launchpad for C++ AMP and the HSA Foundation. This year’s Summit will continue that trend towards heterogeneous computing as well as look back over the year and provide updates on where the various HSA member companies are at as far as goals to move towards standards-based heterogenous computing.
In addition to keynote speeches from AMD and some of its partners, expect a great deal of presentations and workshops from researchers and programmers that are working on new programming models and hardware solutions to efficiently use CPU and GPU processors. More information on hUMA is one of the likely topics, for example. Discussion about upcoming hardware, process nodes, and products may also be on the table so far as it relates to the HSA theme. Considering the summit is called “APU13,” I also expect that AMD will reveal additional details on the company’s Kaveri APU as well as a look into its future product road map.
AMD is currently asking for presentation proposals from researchers in a number of HSA and technology-related fields including heterogeneous computing, cloud computing, web technologies, programming languages, gaming and graphics technologies, and software security. The lineup of presenters for the summit is still being worked out, and proposal papers will be accepted until May 10th with the winners being notified over the summer.
In all, AMD’s APU13 should be an exciting and intellectual event. Last year’s AMD Fusion Developer Summit (AFDS) was an interesting and fun event to cover, and I hope that APU13 will keep up the same momentum and interest in heterogeneous computing that AFDS started.
Subject: General Tech | April 30, 2013 - 01:23 PM | Jeremy Hellstrom
Tagged: Steamroller, piledriver, Kaveri, Kabini, hUMA, hsa, GCN, bulldozer, APU, amd
AMD may have united GPU and CPU into the APU but one hurdle had remained until now, the the non-uniformity of memory access between the two processors. Today we learned about one of the first successful HAS projects called Heterogeneous Uniform Memory Access, aka hUMA, which will appear in the upcoming Kaveri chip family. The use of this new technology will allow the on-die CPU and GPU to access the same memory pool, both physical and virtual and any data passed between the two processors will remain coherent. As The Tech Report mentions in their overview hUMA will not provide as much of a benefit to discrete GPUs, while they will be able to share address space the widely differing clock speeds between GDDR5 and DDR3 prevent unification to the level of an APU.
Make sure to read Josh's take as well so you can keep up with him on the Podcast.
"At the Fusion Developer Summit last June, AMD CTO Mark Papermaster teased Kaveri, AMD's next-generation APU due later this year. Among other things, Papermaster revealed that Kaveri will be based on the Steamroller architecture and that it will be the first AMD APU with fully shared memory.
Last week, AMD shed some more light on Kaveri's uniform memory architecture, which now has a snazzy marketing name: heterogeneous uniform memory access, or hUMA for short."
Here is some more Tech News from around the web:
- AMD’s new heterogeneous Uniform Memory Access
- hUMA; AMD’s Heterogeneous Unified Memory Architecture @ Hardware Canucks
- Compro TN50W Cloud Network Camera @ Tweaktown
- Wifi Pineapple project uses updated hardware for man-in-the-middle attacks @ Hack a Day
- New OpenWRT Drops Support For Linux 2.4, Low-Mem Devices @ Slashdot
- HP mashes up ProLiant, Integrity, BladeSystem, and Moonshot server @ The Register
- Acer selling tablet using Intel Y series processor @ The Register
- CERN Celebrates 20 Years of an Open Web (and Rebuilds 1st Web Page) @ Slashdot
- BitFenix 5K YouTube Subscriber Giveaway @ eTeknix
heterogeneous Uniform Memory Access
Several years back we first heard AMD’s plans on creating a uniform memory architecture which will allow the CPU to share address spaces with the GPU. The promise here is to create a very efficient architecture that will provide excellent performance in a mixed environment of serial and parallel programming loads. When GPU computing came on the scene it was full of great promise. The idea of a heavily parallel processing unit that will accelerate both integer and floating point workloads could be a potential gold mine in wide variety of applications. Alas, the promise of the technology did not meet expectations when we have viewed the results so far. There are many problems with combining serial and parallel workloads between CPUs and GPUs, and a lot of this has to do with very basic programming and the communication of data between two separate memory pools.
CPUs and GPUs do not share common memory pools. Instead of using pointers in programming to tell each individual unit where data is stored in memory, the current implementation of GPU computing requires the CPU to write the contents of that address to the standalone memory pool of the GPU. This is time consuming and wastes cycles. It also increases programming complexity to be able to adjust to such situations. Typically only very advanced programmers with a lot of expertise in this subject could program effective operations to take these limitations into consideration. The lack of unified memory between CPU and GPU has hindered the adoption of the technology for a lot of applications which could potentially use the massively parallel processing capabilities of a GPU.
The idea for GPU compute has been around for a long time (comparatively). I still remember getting very excited about the idea of using a high end video card along with a card like the old GeForce 6600 GT to be a coprocessor which would handle heavy math operations and PhysX. That particular plan never quite came to fruition, but the idea was planted years before the actual introduction of modern DX9/10/11 hardware. It seems as if this step with hUMA could actually provide a great amount of impetus to implement a wide range of applications which can actively utilize the GPU portion of an APU.
Jaguar Hits the Embedded Space
It has long been known that AMD has simply not had a lot of luck going head to head against Intel in the processor market. Some years back they worked on differentiating themselves, and in so doing have been able to stay afloat through hard times. The acquisitions that AMD has made in the past decade are starting to make a difference in the company, especially now that the PC market that they have relied upon for revenue and growth opportunities is suddenly contracting. This of course puts a cramp in AMD’s style, but with better than expected results in their previous quarter, things are not nearly as dim as some would expect.
Q1 was still pretty harsh for AMD, but they maintained their marketshare in both processors and graphics chips. One area that looks to get a boost is that of embedded processors. AMD has offered embedded processors for some time, but with the way the market is heading they look to really ramp up their offerings to fit in a variety of applications and SKUs. The last generation of G-series processors were based upon the Bobcat/Brazos platform. This two chip design (APU and media hub) came in a variety of wattages with good performance from both the CPU and GPU portion. While the setup looked pretty good on paper, it was not widely implemented because of the added complexity of a two chip design plus thermal concerns vs. performance.
AMD looks to address these problems with one of their first, true SOC designs. The latest G-series SOC’s are based upon the brand new Jaguar core from AMD. Jaguar is the successor to the successful Bobcat core which is a low power, dual core processor with integrated DX11/VLIW5 based graphics. Jaguar improves performance vs. Bobcat in CPU operations between 6% to 13% when clocked identically, but because it is manufactured on a smaller process node it is able to do so without using as much power. Jaguar can come in both dual core and quad core packages. The graphics portion is based on the latest GCN architecture.
AMD has announced that is will be hosting an event for fans in San Francisco this weekend. The AMD Fan Day is free with registration (register here), and fans will give enthusiasts a chance to go hands-on with the company's 2013 hardware lineup, play several newly released (and some not-yet-released) games, talk with industry experts, check out modded PCs, and have a chance to win free hardware and swag from AMD, Corsair, and Gigabyte.
Gamers will get a chance to speak with the developers for Bioshock Infinite, Far Cry 3, Crysis 3, Devil May Cry (DMC), and Tomb Raider as well as AMD representatives. VIZIO, IGN, Ubisoft, Sapphire, and Logitech will also be attending the AMD fan day to show off their latest products.
The event will held at City View at Metreon (address below) at 5:30pm on Saturday, April 6th. Best of all, the first 1,000 registered attendees in the door will get a free AMD A8 5600K APU. The first 120 attendees will win both an A8 5600K APU and an A85X motherboard.
One of the modded PCs that will be on the event floor.
If you're going to be in the area this weekend and are interested in going, be sure to head over to the AMD site and register. It sounds like it should be a fun time, and the free hardware doesn't hurt!
The AMD Fan Day will be held at the following address:
City View at Metreon
135 4th Street
San Francisco, CA 94013
Will you be checking out the AMD fan day to enjoy some gaming and PC hardware?
Subject: General Tech | March 31, 2013 - 02:21 AM | Tim Verry
Tagged: sony, ps4, playstation eye, playstation 4, gaming, dualshock 4, APU, amd
Sony teased a few more details about its upcoming PlayStation 4 console at the Games Developer's Conference earlier this week. While the basic specifications have not changed since the original announcement, we now know more about the X86 console hardware.
The PS4 itself is powered by an AMD Jaguar CPU with eight physical cores and eight threads. Each core gets 32 KB L1 I-cache and D-cache. Further, each group of four physical cores shares 2 MB of L2 cache, for 4MB total L2. The processor is capable of Out of Order Execution, as are AMDs other processor offerings. The console also reportedly features 8GB of GDDR5 memory that is shared by the CPU and GPU. It offers 176 GB/s of bandwidth, and is a step above the PS3 which did not use a unified memory design. The system will also sport a faster GPU rated at 1.843 TFLOPS, and clocked at 800MHz. The PS3 will have a high-capacity hard drive and a new Blu-ray drive that is up to 3-times faster. Interestingly, the console also has a co-processor that allows the system to process the video streaming features and allow the Remote Play game streaming to the PlayStation Vita at its native resolution of 960x554.
The PlayStation Eye has also been upgraded with the PS4 to include 2 cameras, four microphones, and a 3-axis accelerometer. The Eye cameras have an 85-degree field of view, and can record video at 1280x800 at 60 Hz and 12 bits per pixel or 640x480 and 120Hz. The new PS4 Eye is a noteworthy upgrade to the current generation model which is limited to either 640x480 pixels at 60Hz or 320x240 pixels at 120Hz. The extra resolution should allow developers to be more accurate. The DualShock 4 controllers sport a light-bar that can be tracked by the new Eye camera, for example. The light-bar on the controllers uses an RGB LED that changes to blue, red, pink, or green for players 1-4 respectively.
Speaking of the new DualShock 4, Sony has reportedly ditched the analog face buttons and D-pad for digital buttons. With the DS3 and the PS3, the analog face buttons and D-pad came in handy with racing games, but otherwise they are not likely to be missed. The controllers will now charge even when the console is in standby mode, and the L2 and R2 triggers are more resistant to accidental pressure. The analog sticks have been slightly modified and feature a reduced dead zone. The touchpad, which is a completely new feature for the DualShock lineup, is capable of tracking 2 points at a resolution of 1920x900–which is pretty good.
While Sony has still not revealed what the actual PS4 console will look like, most of the internals are now officially known. It will be interesting to see just where Sony prices the new console, and where game developers are able to take it. Using a DX11.1+ feature set, developers are able to use many of the same tools used to program PC titles but also have additional debugging tools and low level access to the hardware. A new low level API below DirectX, but above the driver level gives developers deeper access to the shader pipeline. I'm curious to see how PC ports will turn out, with the consoles now running X86 hardware, I'm hoping that the usual fare of bugs common to ported titles from consoles to PCs will decrease–a gamer can dream, right?
Subject: General Tech | March 14, 2013 - 03:36 PM | Ken Addison
Tagged: Strider, steambox, steam, sshd, Silverstone, Seagate, Richland, quadro 6000, quadro, podcast, hybrid, APU, amd
PC Perspective Podcast #242 - 03/14/2013
Join us this week as we discuss AMD's new Richland APUs, Steam Box Prototypes, Seagate Hybrid Drives and more!
The URL for the podcast is: http://pcper.com/podcast - Share with your friends!
- iTunes - Subscribe to the podcast directly through the Store
- RSS - Subscribe through your regular RSS reader
- MP3 - Direct download link to the MP3 file
Hosts: Ryan Shrout, Jeremy Hellstrom, Josh Walrath and Allyn Malventano
Program length: 0:59:45
Podcast topics of discussion:
- Week in Reviews:
- 0:14:33 This Podcast is brought to you by MSI!
- News items of interest:
- 1-888-38-PCPER or email@example.com
- http://twitter.com/ryanshrout and http://twitter.com/pcper
Subject: Processors | March 12, 2013 - 02:52 PM | Jeremy Hellstrom
Tagged: VLIW4, trinity, Richland, piledriver, notebook, mobile, hd 8000, APU, amd, A10-5750
The differences between Richland and Trinity are not earth shattering but there are certainly some refinements implemented by AMD in the A10-5750. One very noticeable one is support for DDR3-1866 as well as better power management for both the CPU and GPU; with new temperature balancing algorithms and measurement the ability to balance the load properly has increased from Trinity. Many AMD users will be more interested in the GPU portion of the die than the CPU, as that is where AMD actually has as lead on Intel and this particular chip contains the HD8650G, with clocks of 720MHz boost and 533MHz base and increase from the previous generation of 35 and 37MHz respectively. You can read more about the other three models that will be released over at The Tech Report.
"AMD has formally introduced the first members of its Richland APU family. We have the goods on the chips and Richland's new power management tech, which combines temperature-based inputs with bottleneck-aware clock boosting."
Here are some more Processor articles from around the web:
- AMD Richland APU Preview: Trinity Gets a Facelift @ Hardware Canucks
- 2013 AMD Mobile APU (Richland) @ Bjorn3D
- Westmere-EP to Sandy Bridge-EP: The Scientist Potential Upgrade @ AnandTech
- AMD Phenom II X4 955, Phenom II X4 960T, Phenom II X6 1075T and Intel Pentium G2120, Core i3-3220, Core i5-3330 @ ixbt.com
- AMD FX-8350 @ iXBT Labs
- The new Opteron 6300: Finally Tested! @ AnandTech
- Intel Core i5-3570K vs. i7-3770K Ivy Bridge @ techPowerUp
AMD Exposes Richland
When we first heard about “Richland” last year, there was a little bit of excitement from people. Not many were sure what to expect other than a faster “Trinity” based CPU with a couple extra goodies. Today we finally get to see what Richland is. While interesting, it is not necessarily exciting. While an improvement, it will not take AMD over the top in the mobile market. What it actually brings to the table is better competition and a software suite that could help to convince buyers to choose AMD instead of a competing Intel part.
From a design standpoint, it is nearly identical to the previous Trinity. That being said, a modern processor is not exactly simple. A lot of software optimizations can be applied to these products to increase performance and efficiency. It seems that AMD has done exactly that. We had heard rumors that the graphics portion was in fact changed, but it looks like it has stayed the same. Process improvements have been made, but that is about the extent of actual hardware changes to the design.
The new Richland APUs are branded the A-5000 series of products. The top end is the A10-5750M with HD-8650 integrated graphics. This is still the VLIW-4 based graphics unit seen in the previous Trinity products, but enough changes have been made with software that I can enable Dual Graphics with the new Solar System based GPUs (GCN). The speeds of these products have received a nice boost. As compared to the previous top end A10-4600, the 5750 takes the base speed from 2.3 GHz to 2.5 GHz. Boost goes from 3.2 GHz up to 3.5 GHz. The graphics portion takes the base clock from 496 MHz up to 533 MHz, while turbo mode improves over the 4600 from 685 MHz to 720 MHz. These are not staggering figures, but it all still fits within the 35 watt TDP of the previous product.
One other important improvement is the ability to utilize DDR-3 1866 memory. Throughout the past year we have seen memory densities increase fairly dramatically without impacting power consumption. This goes for speed as well. While we would expect to see lower power DIMMs be used in the thin and light categories, expect to see faster DDR-3 1866 in the larger notebooks that will soon be heading our way.
Subject: Motherboards | March 8, 2013 - 06:30 AM | Tim Verry
Tagged: roundup, motherboards, mini-itx, celeron 847, APU, amd e-450
While high end motherboards and processors tend to get the most attention from enthusiasts, sometimes less is better (*waits for Josh to stop laughing on the podcast). More often than not seen integrated in small form factor OEM boxes, there are a few motherboards out there that come as a bare board and integrated processor to be the basis of low power desktops, network devices, and home theater PCs. Both Intel and AMD have hats in the low power game, and Hartware.de has pitched four such low power boards against each other. The MSI C847MS-E33-847 and Biostar NM70I pack Intel Celeron 847 CPUs, The Zotac D2550-ITX WIFI hosts an Intel Atom D2550 processor plus a NVIDIA GT 610 IGP, and the ASUS E45MI-M Pro is powered by an AMD E-450 APU.
Hartware.de puts several low power boards into the thunderdome to see which one(s) reign supreme.
As it turns out, the results are nearly in line with what one might expect. The Atom D2550-powered system was the slowest, the APU and ASUS motherboard was the fastest, and the Celeron was somewhere in the middle. The AMD E-450 APU used the most power, and the system was one of the most expensive, however. Interestingly, the Atom system was not all that much more power efficient than the Celeron despite the lower performance and weaker hardware. The Celeron 847 chip had decent CPU performance, and mid-range power and some of the best thermals. All of the configurations were able to playback media, but the AMD system gave the most fluid results.
If you are in the market for low power system parts, the review is worth checking out.
Here are some additional Motherboard reviews from around the web:
- GIGABYTE Z77N-WiFi Mini-ITX @ TweakTown
- ASRock Z77 Pro4-M LGA 1155 @ HardOCP
- Gigabyte GA-F2A85X-UP4 FM2 @ PC Perspective
- ASRock's Z77E-ITX Mini ITX @ The Tech Repot
- ASUS Sabertooth 990FX R2.0 @ OCaholic
I'm pleasantly surprised at all the Mini-ITX motherboards being made lately.