Subject: Processors | November 13, 2013 - 05:35 PM | Josh Walrath
Tagged: Puma, Mullins, mobile, Jaguar, GCN, beema, apu13, APU, amd, 2014
AMD’s APU13 is all about APUs and their programming, but the hardware we have seen so far has been dominated by the upcoming Kaveri products for FM2+. It seems that AMD has more up their sleeves for release this next year, and it has somewhat caught me off guard. The Beema and Mullins based products are being announced today, but we do not have exact details on these products. The codenames have been around for some time now, but interest has been minimal since they are evolutionary products based on Kabini and Temash APUs that have been available this year. Little did I know that things would be far more interesting than that.
The basis for Beema and Mullins is the Puma core. This is a highly optimized revision of Jaguar, and in some ways can be considered a new design. All of the basics in terms of execution units, caches, and memory controllers are the same. What AMD has done is go through the design with a fine toothed comb and make it far more efficient per clock than what we have seen previously. This is still a 28 nm part, but the extra attention and love lavished upon it by AMD has resulted in a much more efficient system architecture for the CPU and GPU portions.
The parts will be offered in two and four core configurations. Beema will span from 10W to 25W configurations. Mullins will go all the way down to “2W SDP”. SDP essentially means that while the chip can be theoretically rated higher, it will rarely go above that 2W envelope in the vast majority of situations. These chips are expected to be around 2X more efficient per clock than the previous Jaguar based products. This means that at similar clock speeds, Beema and Mullins will pull far less power than that previous gen. It should also allow some higher clockspeeds at the top end 25W area.
These will be some of the first fanless quad cores that AMD will introduce for the tablet market. Previously we have seen tablets utilize the cut down versions of Temash to hit power targets, but with this redesign it is entirely possible to utilize the fully enabled quad core Mullins. AMD has not given us specific speeds for these products, but we can guess that they will be around what we see currently, but the chip will just have a lower TDP rating.
AMD is introducing their new security platform based on the ARM Trustzone. Essentially a small ARM Cortex A5 is integrated in the design and handles the security aspects of this feature. We were not briefed on how this achieves security, but the slide below gives some of the bullet points of the technology.
Since the pure-play foundries will not have a workable 20 nm process for AMD to jump to in a timely manner, AMD had no other choice but to really optimize the Jaguar core to make it more competitive with products from Intel and the ARM partners. At 28 nm the ARM ecosystem has a power advantage over AMD, while at 22 nm Intel offers similar performance to AMD but with greater power efficiency.
This is a necessary update for AMD as the competition has certainly not slowed down. AMD is more constrained obviously by the lack of a next-generation process node available for 1H 2014, so a redesign of this magnitude was needed. The performance per watt metric is very important here, as it promises longer battery life without giving up the performance people received from the previous Kabini/Temash family of APUs. This design work could be carried over to the next generation of APUs using 20 nm and below, which hopefully will keep AMD competitive with the rest of the market. Beema and Mullins are interesting looking products that will be shown off at CES 2014.
Subject: Graphics Cards, Processors | November 12, 2013 - 06:10 PM | Ryan Shrout
Tagged: amd, Kaveri, APU, video, hsa
Yesterday at the AMD APU13 developer conference, the company showed off the upcoming Kaveri APU running Battlefield 4 completely on the integrated graphics. I was able to push the AMD guys along and get a little more personal demo to share with our readers. The Kaveri APU had some of its details revealed this week:
- Quad-core Steamroller x86
- 512 Stream Processor GPU
- 856 GFLOPS of theoretical performance
- 3.7 GHz CPU clock speed, 720 MHz GPU clock speed
AMD wanted to be sure we pointed out in this video that the estimate clock speeds for FLOP performance may not be what the demo system was run at (likely a bit lower). Also, the version of Battlefield 4 here is the standard retail version and with further improvements from the driver team as the upcoming Mantle API implementation will likely introduce even more performance for the APU.
The game was running at 1920x1080 with MOSTLY medium quality settings (lighting set to low) but the results still looked damn impressive and the frame rates were silky and smooth. Considering this is running on a desktop with integrated processor graphics, the game play experience is simply unmatched.
Memory in the system was running at 2133 MHz.
The second demo looks at the image decoding acceleration that AMD is going to enable with Kaveri APUs upon release with a driver. Essentially, as the demonstration shows in the video, AMD is overwriting the integrated Windows JPG decompression algorithm with a new one that utilizes HSA to accelerate on both the x86 and SIMD (GPU) portions of the silicon. For the most strenuous demo that used 22 MP images saw a 100% increase in performance compared to the Kaveri CPU cores alone.
More Details from Lisa Su
The executives at AMD like to break their own NDAs. Then again, they are the ones typically setting these NDA dates, so it isn’t a big deal. It is no secret that Kaveri has been in the pipeline for some time. We knew a lot of the basic details of the product, but there were certainly things that were missing. Lisu Su went up onstage and shared a few new details with us.
Kaveri will be made up of 4 “Steamroller” cores, which are enhanced versions of the previous Bulldozer/Trinity/Vishera families of products. Nearly everything in the processor is doubled. It now has dual decode, more cache, larger TLBs, and a host of other smaller features that all add up to greater single thread performance and better multi-threaded handling and performance. Integer performance will be improved, and the FPU/MMX/SSE unit now features 2 x 128 bit FMAC units which can “fuse” and support AVX 256.
However, there was no mention of the fabled 6 core Kaveri. At this time, it is unlikely that particular product will be launched anytime soon.
AMD is up to some interesting things. Today at AMD’s tech day, we discovered a veritable cornucopia of information. Some of it was pretty interesting (audio), some was discussed ad-naseum (audio, audio, and more audio), and one thing in particular was quite shocking. Mantle was the final, big subject that AMD was willing to discuss. Many assumed that the R9 290X would be the primary focus of this talk, but in fact it very much was an aside that was not discussed at any length. AMD basically said, “Yes, the card exists, and it has some new features that we are not going to really go over at this time.” Mantle, as a technology, is at the same time a logical step as well as an unforeseen one. So what all does Mantle mean for users?
Looking back through the mists of time, when dinosaurs roamed the earth, the individual 3D chip makers all implemented low level APIs that allowed programmers to get closer to the silicon than what other APIs such as Direct3D and OpenGL would allow. This was a very efficient way of doing things in terms of graphics performance. It was an inefficient way to do things for a developer writing code for multiple APIs. Microsoft and the Kronos Group had solutions with Direct3D and OpenGL that allowed these programmers to develop for these high level APIs very simply (comparatively so). The developers could write code that would run D3D/OpenGL, and the graphics chip manufacturers would write drivers that would interface with Direct3D/OpenGL, which then go through a hardware abstraction layer to communicate with the hardware. The onus was then on the graphics people to create solid, high performance drivers that would work well with DirectX or OpenGL, so the game developer would not have to code directly for a multitude of current and older graphics cards.
Retiring the Workhorses
There is an inevitable shift coming. Honestly, this has been quite obvious for some time, but it has just taken AMD a bit longer to get here than many have expected. Some years back we saw AMD release their new motto, “The Future is Fusion”. While many thought it somewhat interesting and trite, it actually foreshadowed the massive shift from monolithic CPU cores to their APUs. Right now AMD’s APUs are doing “ok” in desktops and are gaining traction in mobile applications. What most people do not realize is that AMD will be going all APU all the time in the very near future.
We can look over the past few years and see that AMD has been headed in this direction for some time, but they simply have not had all the materials in place to make this dramatic shift. To get a better understanding of where AMD is heading, how they plan to address multiple markets, and what kind of pressures they are under, we have to look at the two major non-APU markets that AMD is currently hanging onto by a thread. In some ways, timing has been against AMD, not to mention available process technologies.
Subject: Processors, Mobile | September 4, 2013 - 11:32 AM | Ryan Shrout
Tagged: Temash, ifa 2013, asus, APU, amd, a4-1200
The hits just keep coming from ASUS this morning with the announcement of a new ultraportable notebook with the ambiguous name of X102BA. Though the name might not be catchy the device itself is more interesting because of the hardware that is powering it, specifically an AMD Temash A4-1200 APU.
This marks one of the few highly visible systems being powered by the AMD Temash architecture and I will be very curious to its reception. The APU itself is a dual-core part that runs at 1.0 GHz with integrated Radeon HD 8180 graphics that is more than enough for a modest Windows 8 working environment. There is a quad-core variant of Temash available but ASUS decided to go with the dual-core option. If you need more information on the new architecture that AMD created for Kabini and Temash (based on Jaguar CPU cores and GCN GPU cores) then you should see our coverage from their announcement back in May.
The rest of the specifications are a bit more tame, including a 1366x768 10.1-in 10-point multi-touch screen, USB 3.0, 802.11n WIFI, bundled Microsoft Office Home and Student 2013 and a touted 2-second resume time.
Even though the battery life is only listed at 5 hours, the 2.4 pound weight makes the X102BA a very portable machine. Plus you can get it in Hot Pink!
Subject: General Tech, Graphics Cards, Processors | August 16, 2013 - 04:00 PM | Scott Michaud
Tagged: nvidia, Intel, APU, amd
Despite a slight decline in PC sales compared to last quarter, graphics processors are on the rise. Jon Peddie Research attributes the heightened interest in graphics, with a decline in systems, to a trend towards multiple GPUs in a system. Crossfire and SLI, according to the report, are not driving this drift but they are relevant. More importantly, consumers are adding discrete graphics to systems with integrated solutions.
AMD has experienced an increase in shipments of 47% for laptop APUs. Desktop heterogeneous processors declined but, in all, shipments increased 11%. Intel, likewise, saw an increase albeit just 6%. NVIDIA declined 8%. AMD now enjoys a 5.8% lead in total market share over NVIDIA.
Many PCs have access to multiple graphics processors simultaneously. With an increase of available GPUs, software developers might take the plunge into fully supporting heterogeneous architectures. You could imagine a game which offloads physics or AI pathfinding to secondary graphics. Sure, the increased heat would slightly limit the turbo-performance of the CPU, but the increased parallel performance should overtake that decreased serial performance for a sensible developer.
JPR claims an average of nearly 1.4 GPUs available per system.
The increased laptop heterogeneous processors is a major win for AMD. Still, I wonder how much Never Settle played in to users dropping discrete graphics into machines which would otherwise have integrated (chipset or processor) graphics. The discrete graphics market has declined and yet somehow AMD got a boost from double-attach or replaced graphics.
The report only discusses consumer x86 tablets, desktops, laptops, and some hybrid between the previous three categories. Other processor architectures or x86 servers are not covered.
Subject: Processors | July 31, 2013 - 03:59 AM | Tim Verry
Tagged: Kaveri, fm2, carrizo, APU, amd radeon, amd
Rumors recently surfaced via VR-Zone china that AMD’s Kaveri APU successor will be code-named Carrizo, and it will be compatible with the upcoming FM2+ socket and AMD A88X chipset that Kaveri will use.
AMD’s Carrizo APUs will reportedly be available in TDPs up to 65W and will feature Excavator CPU cores along with a next generation Radeon GPU. Much like Kaveri, Carrizo will be fully HSA compliant. The chips will also include support for DDR4.
Carrizo will allegedly begin sampling in August 2014 with mass production starting around December. That means Carrizo will be available for purchase within the first half of 2015.
FM2+ boards like the ASUS A55BM-A/USB3 are rumored to support AMD's Carrizo APUs (the successor to Kaveri).
The rumors also suggest that Carrizo will be joined by a low power “Beema” System on a Chip (SoC) and a BGA-based Nolan APU for embedded systems. Details on these complementary chips are scarce, however. Perhaps most telling is last bit of the article that suggests that AMD will not be releasing a AM3+ Vishera CPU-only processor successor. It seems AMD is going all in as an APU company after all.
I have been looking forward to the launh of AMD's Kaveri since AFDS 2012, and Carrizo appears to be a refinement of that chip. It should be more power efficient and faster thanks to architecture tweaks and process shrinks. I think that AMDs architecture and HSA approach has potential, and I'm excited to see what these upcoming chips can do with regards to performance.
Subject: Motherboards | July 28, 2013 - 07:40 PM | Tim Verry
Tagged: hUMA, Kaveri, hsa, fm2, asus, APU, A88X, A55, PCI-E 3.0, mATX
ASUS recently announced two new socket FM2+ motherboards that are compatible with AMD’s upcoming “Kaveri” Accelerated Processing Units (APUs). The new boards are the A88XMA and A55BM-A/USB3 and use the A88X and A55 AMD chipsets respectively. Pricing and availability have not yet been released, but the new boards confirm that users will need new motherboards in order to take advantage of AMD’s next generation APUs (though the new FM2+ boards are backwards compatible with the existing APUs, it will not work the other way around). Both motherboards should be available around the time of the Kaveri processor launch (2H 2013).
The AMD A88XMA FM2+ Motherboard.
Both the ASUS A88XMA and A55BM-A/USB3 motherboards come in the mATX form factor. The boards both have FM2+ processor sockets and expansion slots including a single PCI-E 3.0 x16 slot, one PCI-E 2.0 x1 slot, and one legacy PCI slot. The support for PCI-E 3.0 is new for AMD motherboards, and the extra bandwidth may prove useful for as graphics cards get faster and AMD works on its hUMA and HSA architectures to create a layer of virtual memory that can be simultaneously addressed by CPUs and GPUs. There will still be latency to deal with over the PCI-E bus, but more data can be moved back and forth in the same amount of time.
The two ASUS FM2+ motherboards also share the same rear IO options, which include:
- 2 x PS/2
3 x Video outputs:
- 1 x HDMI
- 1 x DVI
- 1 x VGA
- 2 x USB 3.0
- 2 x USB 2.0
- 1 x Gigabit Ethernet
- 3 x Analog audio jacks
The RJ45 jacks are backed by a Realtek 8111G Gigabit Ethernet controller and the audio jacks are handled by a Realtek ALC887-VD chipset. Finally, they also have UEFI BIOSes in common, but from there the two boards diverge in hardware capabilities.
The ASUS A88XMA is the higher-end of the two boards, and features a FM2+ socket, four DDR3 DIMM slots, and six SATA 3 6Gbps ports. It utilizes the AMD A88X chipset which is aimed at enthusiast platforms.
ASUS' A55BM-A/USB3 budget motherboard.
On the other hand, the A55BM-A/USB3 motherboard uses the cheaper A55 chipset. That motherboard features an FM2+ socket, two DDR3 DIMM slots, and six right angle SATA 2 3Gpbs ports. The A55Bm-A/USB3 should be significantly cheaper as a result of the A55 chipset and resulting hardware reductions. In most other respects, ASUS has managed to make the two baords remarkably similar, including aesthetics and basic board layout.
According to Bit-Tech, the two boards are are part of a larger family of boards with the new FM2+ sockets. As such, we should see additional ASUS boards that fill in the gaps between the two models closer to AMD's Kaveri launch. As noted above, ASUS has not provided official pricing or release date information yet.
Subject: Processors | July 28, 2013 - 01:08 PM | Tim Verry
Tagged: Richland, overclocking, LN2, APU, amd, a10-6800k
A Finnish overclocker known as “The Stilt” recently pushed an AMD Richland APU to 8.2GHz using liquid nitrogen. In doing so, The Stilt broke the world record for APU overclocking, besting his previous overclock attempt.
Specifically, the chip was a retail version of the AMD A10-6800K “Richland” APU. It was overclocked to 8203.01 MHz with a 130.21 MHz base clock and 63x multiplier. Even more impressive is that The Stilt managed the overclock with less voltage -- 1.968 volts -- than his earlier (and lower) overclock. For comparison, the earlier overclock brought the A10-6800K to 8000.48 MHz using 2.008 volts.
The system used to overclock the APU included an ASUS F2A85-V Pro motherboard, 8GB of AMD DDR3 Performance memory, and a Radeon HD 7750 graphics card. The overclocker used liquid nitrogen to cool the APU while the GPU was left at stock settings and with its default air cooler. The RAM was overclocked to 2083.6 MHz with 10-11-10-27 timings.
In all, it is an impressive overclock considering all four CPU cores were left enabled! More details along with validation of the overclock can be found over at HWBot.
Also read: AMD A10-6800K and A10-6700 Review: Richland Finally Lands @ PC Perspective