Feedback

Taking an Accurate Look at SSD Write Endurance

Subject: Editorial, Storage
Manufacturer: Various
Tagged: tlc, ssd, slc, mlc, endurance

Taking an Accurate Look at SSD Write Endurance

Last year, I posted a rebuttal to a paper describing the future of flash memory as ‘bleak’. The paper went through great (and convoluted) lengths to paint a tragic picture of flash memory endurance moving forward. Yesterday a newer paper hit Slashdotthis one doing just the opposite, and going as far as to assume production flash memory handling up to 1 Million erase cycles. You’d think that since I’m constantly pushing flash memory as a viable, reliable, and super-fast successor to Hard Disks (aka 'Spinning Rust'), that I’d just sit back on this one and let it fly. After all, it helps make my argument! Well, I can’t, because if there are errors published on a topic so important to me, it’s in the interest of journalistic integrity that I must now post an equal and opposite rebuttal to this one – even if it works against my case.

First I’m going to invite you to read through the paper in question. After doing so, I’m now going to pick it apart. Unfortunately I’m crunched for time today, so I’m going to reduce my dissertation into the form of some simple bulleted points:

  • Max data write speed did not take into account 8/10 encoding, meaning 6Gb/sec = 600MB/sec, not 750MB/sec.
  • The flash *page* size (8KB) and block sizes (2MB) chosen more closely resemble that of MLC parts (not SLC – see below for why this is important).
  • The paper makes no reference to Write Amplification.

Perhaps the most glaring and significant is that all of the formulas, while correct, fail to consider the most important factor when dealing with flash memory writes – Write Amplification.

Before geting into it, I'll reference the excellent graphic that Anand put in his SSD Relapse piece:

View Full Size

SSD controllers combine smaller writes into larger ones in an attempt to speed up the effective write speed. This falls flat once all flash blocks have been written to at least once. From that point forward, the SSD must play musical chairs with the data on each and every small write. In a bad case, a single 4KB write turns into a 2MB write. For that example, Write Amplification would be a factor of 500, meaning the flash memory is cycled at 500x the rate calculated in the paper. Sure that’s an extreme example, but the point is that without referencing amplification at all, it is assumed to be a factor of 1, which would only be the case if you were only writing 2MB blocks of data to the SSD. This is almost never the case, regardless of Operating System.

After posters on Slashdot called out the author on his assumptions of rated P/E cycles, he went back and added two links to justify his figures. The problem is that the first links to a 2005 data sheet for 90nm SLC flash. Samsung’s 90nm flash was 1Gb per die (128MB). The packages were available with up to 4 dies each, and scaling up to a typical 16-chip SSD, that only gives you an 8GB SSD. Not very practical. That’s not to say 100k is an inaccurate figure for SLC endurance. It’s just a really bad reference to use is all. Here's a better one from the Flash Memory Summit a couple of years back:

View Full Size

The second link was a 2008 PR blast from Micron, based on their proposed pushing of the 34nm process to its limits. “One Million Write Cycles” was nothing more than a tag line for an achievement accomplished in a lab under ideal conditions. That figure was never reached in anything you could actually buy in a SATA SSD. A better reference would be from that same presentation at the Summit:

View Full Size

This shows larger process nodes hitting even beyond 1 million cycles (given sufficient additional error bits used for error correction), but remember it has to be something that is available and in a usable capacity to be practical for real world use, and that’s just not the case for the flash in the above chart.

At the end of the day, manufacturers must balance cost, capacity, and longevity. This forces a push towards smaller processes (for more capacity per cost), with the limit being how much endurance they are willing to give up in the process. In the end they choose based on what the customer needs. Enterprise use leans towards SLC or eMLC, as they are willing to spend more for the gain in endurance. Typical PC users get standard MLC and now even TLC, which are *good enough* for that application. It's worth noting that most SSD failures are not due to burning out all of the available flash P/E cycles. The vast majority are due to infant mortality failures of the controller or even due to buggy firmware. I've never written enough to any single consumer SSD (in normal operation) to wear out all of the flash. The closest I've come to a flash-related failure was when I had an ioDrive fail during testing by excessive heat causing a solder pad to lift on one of the flash chips.

All of this said, I’d love to see a revisit to the author’s well-structured paper – only based on the corrected assumptions I’ve outlined above. *That* is the type of paper I would reference when attempting to make *accurate* arguments for SSD endurance.

 

February 20, 2013 | 03:25 PM - Posted by Allyn Malventano

To add to this with some simple yet generic P/E figures for current gen flash memory:

 

  • SLC - 100k (34nm)
  • MLC - 10k (34nm)
  • MLC - 5k (25nm IMFT)
  • MLC - 3k (~20nm)
  • TLC - 1k (~20nm)
February 20, 2013 | 04:01 PM - Posted by Joe Peters (not verified)

The concern over endurance is way over blown in the consumer space. A 20nm TLC cell can only withstand 1k writes but if you're buying a 250GB drive (like the 840) you have plenty of bits to spread your wear across for normal workloads.

For every NAND shrink we see an increase in density which all but negates the loss in total DRIVE endurance. Sure the individual NAND cell has lower endurance but you get a lot more cells to work with.

And that's not accounting for technological breakthroughs that can help extend the life of NAND. For instance 20nm IMFT has 3k P/E cycles, just like 25nm IMFT MLC because it uses a high-K + metal gate design. Add in adaptive DSP technologies that adjust reference voltages for aging NAND and the eventual shift to 3D NAND and it's clear there's still a lot of life left in NAND.

My guess is the HDD companies are the ones trying to spread this FUD about the flash market.

March 1, 2013 | 05:33 AM - Posted by John Doe (not verified)

FYI, this guy is fucking Les Toker. He also has that fake account named "JoeComp" on [H], and uses an another fake name called "Jim Williams". He's the owner of The SSD Review and there isn't a single place on the web he hasn't SHIT on. He's dumb and retarded as hell... and wants ME to REVIEW SSD's for him...

oh and, don't buy into TLC. TLC is garbage. Yeah, surely it'll last, but what's the point when you can have a MLC SandForce drive for the same price that's faster, more endurable and will be easier to sell?

March 29, 2013 | 09:25 PM - Posted by 599GTB-765RWHP (not verified)

If he want's YOU to review drives for him, then he really must be retarded...

Come on you ignorant little bunghole, TheSSDReview is one of the most respectable sites when it comes to SSD testing!

You are a nobody, and I feel bad for you that you must verbally assault those of us who have reached places in our careers that you only dream of.
Jealous of other peoples talents, having some need to defend your quite-likely inferior computer hardware because heaven forbid you didn't get the absolute best stuff available, or sitting on your behind chastising other peoples' cars ("Why would you waste your money on a Porsche 911 GT3 RS 4.0, everyone knows the Turbo is faster..." or "Why would you buy a 2006 M3? What a stupid thing to do! I'm going to get an Audi RS4 and I'll kick your butt in a race, you'll see!"... I've heard both)

LET OTHER PEOPLE ENJOY THEIR THINGS! You're not just raining on peoples parades, you are like a hurricane made entirely of feces that whirls around the internet, trying to make other people as miserable as you feel on the inside.

February 21, 2013 | 01:02 PM - Posted by Randy Abulon (not verified)

While there are numbers for media lifespan, and the reference materials for any theoretical limit. One should keep in mind, that all of those numbers are still just best guesses by people with various credentials.

While it's highly likely that one may know someone who has had an SSD die or exhibit some failure, there has to be a number of happy SSD users that have gone beyond the date of expiration.

Personally, I've enjoyed my pair of first-gen Intel 80 GB SSDs since December of 2008. And it's followed my gaming machine to countless LAN parties and events--plugging away, every day, for at least 4-12 hour stretches--as my boot drive.

Just how many cycles are estimated to happen in an hour?

February 24, 2013 | 02:29 PM - Posted by Doug Dumitru (not verified)

Many SSDs includes SMART reporting that lets you read or approximate the average write/erase count. If you know the number of bytes written to the drive, you can actually compute wear amp backwards with this. Most of the newer windows SMART tools know how to read these values.

One issue with SSDs is not when they wear out, but how they wear out. When you go past the limit, the drive drops below minimum data error and retention specs. The error specs are likely to be invisible to you as a bit of extra error correction is transparent. The retention issues are more troubling. If you have a 3000 endurance drive at 10,000 cycles, it is likely it still seems to work OK. Then you power it down for a couple of days and it is empty. Not pleasant.

February 21, 2013 | 09:26 PM - Posted by enymouse (not verified)

Pls excuse, but, what about a windows vm (oracle's virtual box) running in debian? Seems to write about 10gigs/eight hrs
Got one on a samsung 830 120G and am starting to get concerned.
Just a basic win 7 install w/ quickbooks, firefox, antivirus
and little else.

Jay

February 23, 2013 | 02:07 PM - Posted by dreamer77dd

I am happy to see the passion in accurate information. This is another reason why i love to come to PC Perspective.com for my information.
:D

Post new comment

The content of this field is kept private and will not be shown publicly.
  • Lines and paragraphs break automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd> <blockquote><p><br>
  • Web page addresses and e-mail addresses turn into links automatically.

More information about formatting options

By submitting this form, you accept the Mollom privacy policy.