NVIDIA Introduces Kepler to the Ultra-Mobile Market and Tegra

Manufacturer: NVIDIA

NVIDIA Finally Gets Serious with Tegra

Tegra has had an interesting run of things.  The original Tegra 1 was utilized only by Microsoft with Zune.  Tegra 2 had a better adoption, but did not produce the design wins to propel NVIDIA to a leadership position in cell phones and tablets.  Tegra 3 found a spot in Microsoft’s Surface, but that has turned out to be a far more bitter experience than expected.  Tegra 4 so far has been integrated into a handful of products and is being featured in NVIDIA’s upcoming Shield product.  It also hit some production snags that made it later to market than expected.

I think the primary issue with the first three generations of products is pretty simple.  There was a distinct lack of differentiation from the other ARM based products around.  Yes, NVIDIA brought their graphics prowess to the market, but never in a form that distanced itself adequately from the competition.  Tegra 2 boasted GeForce based graphics, but we did not find out until later that it was comprised of basically four pixel shaders and four vertex shaders that had more in common with the GeForce 7800/7900 series than it did with any of the modern unified architectures of the time.  Tegra 3 boasted a big graphical boost, but it was in the form of doubling the pixel shader units and leaving the vertex units alone.

View Full Size

While NVIDIA had very strong developer relations and a leg up on the competition in terms of software support, it was never enough to propel Tegra beyond a handful of devices.  NVIDIA is trying to rectify that with Tegra 4 and the 72 shader units that it contains (still divided between pixel and vertex units).  Tegra 4 is not perfect in that it is late to market and the GPU is not OpenGL ES 3.0 compliant.  ARM, Imagination Technologies, and Qualcomm are offering new graphics processing units that are not only OpenGL ES 3.0 compliant, but also offer OpenCL 1.1 support.  Tegra 4 does not support OpenCL.  In fact, it does not support NVIDIA’s in-house CUDA.  Ouch.

Jumping into a new market is not an easy thing, and invariably mistakes will be made.  NVIDIA worked hard to make a solid foundation with their products, and certainly they had to learn to walk before they could run.  Unfortunately, running effectively entails having design wins due to outstanding features, performance, and power consumption.  NVIDIA was really only average in all of those areas.  NVIDIA is hoping to change that.  Their first salvo into offering a product that offers features and support that is a step above the competition is what we are talking about today.

Continue reading our article on the NVIDIA Kepler architecture making its way to mobile markets and Tegra!


Logan Gets Sampled

The next iteration of the Tegra line is code named Logan.  Little is known about this part in general.  We are assuming that it will be NVIDIA’s second Tegra on 28 nm.  Tegra 3 was still fabbed on 40 nm and Tegra 4 is the first 28 nm part.  We do not know if it is a 4+1 setup like previous versions or if they will adopt the big.Little setup that is popular now with other manufacturers.  These things are certainly unknowable because NVIDIA has not disclosed them.  They have disclosed one thing though.  NVIDIA is finally getting with the program when it comes to graphics on mobile.

View Full Size

Kepler was introduced last year with the GeForce 600 series of products and it was a great success for the company.  There were certainly some tradeoffs to the architecture, but overall it was extremely power efficient at the high end and exceptionally focused on the primary workload that it was being used for.  It was a graphics processing machine.  NVIDIA sacrificed GPGPU performance in the name of rendering efficiency and speed.  Kepler could still do the work, but it was hindered as compared to previous architectures in the name of graphics processing power.  This is not necessarily a bad thing as it scaled nicely throughout the entire product line; though perhaps not as efficiently as hoped with the GTX 660 and GTX 650 Ti BOOST lines.  Still, Kepler is certainly a success for NVIDIA as it has translated also to the extreme high end with Titan, Quadro, and Tesla.  These products are not hindered when addressing GPGPU, OpenCL, and CUDA workloads.

If the reader has not guessed by now, Logan will be utilizing Kepler for the integrated graphics on this particular SOC (System On a Chip).  NVIDIA will be providing a licensed ARM processing core and will be pairing it with a single 192 CUDA core SMX.  These are unified shader cores, which are a big step up from the separate pixel/vertex shaders of the previous Tegra parts.  NVIDIA is also jumping from 72 pixel/vertex units in Tegra 4 to a full 192 unified units in Logan.  These units are also much more advanced than the first generation of unified shaders contained in the now legendary GeForce 8800 GTX.  That particular product (a powerhouse for the time) featured 128 unified DX10 shaders.  On paper, this GPU is more powerful than the 8800 GTX.

We were not given the clock speeds that this product will be running at, but we do know that it is aimed at the 2 watt area.  In handheld devices 2 watts will be total SOC power, while tablets and other larger form factors will scale up in wattage as needed.  Consider that the original 8800 GTX was in the 225 watt range, this is a pretty impressive shrink.  Obviously the mobile products will be at a disadvantage when it comes to memory bandwidth and ROP partitions, but in pure floating point output the single Kepler SMX in Logan should outrun the old 8800 GTX.

View Full Size

What Kepler really brings to the table is a part that is fully compliant with all of the latest industry standards.  It goes not just to OpenGL ES 3.0, but jumps to OpenGL 4.4.  It is fully CUDA 5.0 compliant, which is a massive jump from the non-compliancy of previous Tegra parts.  It is also DX11 compliant, which is more than not a boon for Microsoft and their Windows RT platform.  As such it fully implements a strong tessellation engine.  From my understanding, the latest generation competing units do not support this feature.

NVIDIA is finally showing that it can compete in the feature department in the mobile space.  This is a big jump up from what NVIDIA was doing, much less what the rest of the industry is shipping.  NVIDIA is now sampling Logan to its partners for testing purposes.  We do not have a time frame upon which these products will be shipping to consumers, but we would expect it to be around 9 months from now with first available products.

There are still a lot of questions about Logan that cannot be answered at this time.  We do not know how well it will perform against the competition, we do not know what clock speed the GPU portion will be set at, and we certainly do not know what other “secret sauce” that NVIDIA has implemented into this new product.  All we know is that NVIDIA is a very aggressive company and they now have several years of experience in the ARM market.  Samsung and Qualcomm could certainly use some better competition, especially with the specter of Intel really pushing into the mobile market with their Silvermont based Atoms later this year.  There is some thought that Kepler might in fact show up in licensed form in the x86 market, but that rumor is neither confirmed nor denied by NVIDIA at this time.

View Full Size

What can be said for certain is that Logan will utilize the Kepler architecture for graphics and GPGPU functionality.  This is a huge step up for NVIDIA in the Tegra line, and one which could prove to be a big selling point for manufacturers.  Finally NVIDIA has not only matched, but leapfrogged over the competition in terms of graphics and capabilities.  Perhaps this will be the product that we have all been waiting for from NVIDIA?  Time will tell, but so far with NVIDIA sending samples to partners at this early date is a very good sign.  Considering how much distance NVIDIA needs to cover to catch up to Qualcomm and Samsung, this is a good thing.  The world needs more competition in this field.

July 24, 2013 | 09:51 AM - Posted by windwalker

Nvidia has leapfrogged nothing.
Every Tegra chip has had the same glowing previews and lofty promises and has ended up underdelivering massively and arriving late.

Logan sounds like it may show up in products in time for the holiday season 2014. Who knows what what nvidia's significantly less boastful but more successful competitors release by then?

July 24, 2013 | 10:23 AM - Posted by Josh Walrath

A fair point considering their track record.  A lot of hype around Tegra 2, and it never materialized in sales.  Tegra 3 was the same, though it did implement some interesting features.  Tegra 4 again has a lot mroe features (though no CUDA/OpenCL/OpenGL ES 3.0), but it does a lot of video decoding/encoding in hardware that other ARM implementations do not.  Manufacturing issues (design most likely) has caused it to be delayed.

That being said, when we look at the latest generation of competing parts that were released not very long ago, NVIDIA trumps those products in a pretty hefty way.  Having working silicon back from the fab so quickly, and having sampled these parts to partners already, it finally looks like NVIDIA might have a competitive product for a change.  It will all depend on how quickly they can ramp production.

So, optimistic about this release as compared to the overhyped predecessors... but as always, take with a grain of salt.

July 24, 2013 | 11:07 AM - Posted by renz (not verified)

success or not we can found it later but right now i'm more interested with nvidia intention to bring in gpu tech that once not possible with mobile parts. Tegra 3 might not a huge success compared to competitor product but it was quite a boost for nvidia in regards to their tegra business. now with Logan finally CUDA compatible maybe they can push tegra more than just a SoC for phones/tablet or car infotainment system. i think the decision to use 40nm on tegra 3 was one of the very reason they can get more design win for the processor. if they put logan on 28nm maybe they can get similar momentum as tegra 3. then when the TSMC 20nm finally arrive adjust their product accordingly with the new process node transition. btw is there any news on qualcomm and samsung next chip?

July 24, 2013 | 11:29 AM - Posted by Crow (not verified)

I smell complete BS...

nVidia's GPU's are not as efficient as AMD's.

July 24, 2013 | 11:39 AM - Posted by renz (not verified)

this has nothing to do with AMD. anyway with kepler they prove they can get something better than fermi in terms of power consumption.

July 24, 2013 | 01:36 PM - Posted by Anonymous (not verified)

Dude, they are FAR more efficient. The 195W GTX680 is pretty much dead-on with the 230W HD7970. For the 7970GE's 250W TDP, you could have a GTX Titan from Nvidia's side.

July 24, 2013 | 03:23 PM - Posted by arbiter

HD7970 was slower then gtx680, the 7970GE is a hair faster but some games they trade leads in.

July 24, 2013 | 06:34 PM - Posted by Tim (not verified)

Logan wants to kill me...he`s so

July 24, 2013 | 06:36 PM - Posted by Tim (not verified)

Not taking sides here but AMD stock has been dropping like 5-8% per DAY for several days...WHAT HAPPENED ???

July 24, 2013 | 06:37 PM - Posted by Tim (not verified)

The way AMD stock is tanking , they might be out of business soon.

July 24, 2013 | 07:23 PM - Posted by Tim from Nvidia (not verified)

Yeah AMD is terrible Tim. They could never provide a feature set as good as Nivida(R) Kepler(TM) Experience.

July 24, 2013 | 10:06 PM - Posted by renz (not verified)

uh i thought were we discussing about nvidia next SoC here not about amd.

July 25, 2013 | 10:12 AM - Posted by Anonymous (not verified)

My next GPU is going to be NVIDIA...The Way It's Meant to be Played.

July 25, 2013 | 11:38 AM - Posted by Timmeh (not verified)


July 26, 2013 | 02:04 AM - Posted by ShAdOwXPR (not verified)

Ay specs on the Power VR series 6 that's going in he next apple devices? Apparently it's comparable to Logan but its common out in months not next year...

July 29, 2013 | 02:40 AM - Posted by renz (not verified)

performance wise maybe they can compete with each other. i tried looking around but i can't find much about this: does the power vr 6 series will have the same feature included in mobile kepler such as tessellation engine and open gl 4.4 support? i haven't dig in much about open gl 4.4 yet but i heard one of the major feature was to make porting direct x based game to open gl much easier.

Post new comment

The content of this field is kept private and will not be shown publicly.
  • Lines and paragraphs break automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd> <blockquote><p><br>
  • Web page addresses and e-mail addresses turn into links automatically.

More information about formatting options

By submitting this form, you accept the Mollom privacy policy.