


Written by W. Curtis Preston
Tuesday, 15 March 2011 01:28
It's the biggest thing that's happened in backup and recovery in a long time. I can't imagine being "the backup guy" on the other end of this story. Can you imagine the stress of being the last line of defense for gmail? Wow.
We all know the story, right? A software update bug caused somewhere between 150,000 and 500,000 gmail users (which they said was .02% of their user base) were greeted with an empty inbox one morning. Google took a few days to get everything back, and in the end, they had to resort to tapes to do it.
I'm no Google lover. I'm a fan of google.com. I used to use gmail and Google Apps to host my email, but I've since moved off and went with hosted Exchange. So I don't want anyone accusing me of being a Google fanboi, OK? So when I start talking about my thoughts, please don't suggest that the praise I send Google's way is due to any sort of loyalty, alright?
Here's what I learned via this outage:
Google is backing up gmail
I spent some time at a very large ISP a few years ago and was shocked to learn that they were not backing up user's email account. These were paid ISP subscribers' accounts and they were not backing them up. "It's just email," they told me. "Do you know how much it would cost to back that up?"
So I find it admirable that one of the things that came out of this story is that Google is backing up gmail -- even free gmail. There were no comments that said something like "Pro accounts were restored, but free gmail users were not." They backed it all up and they restored it all.
Google is backing up gmail to tape
In this world of cloud backup and disk backup, it was interesting to see that Google's last line of defense was still tape. They replicate things to multiple data centers, but at some point they back it up. And when they do, they do it to tape. The biggest reason that I can think of is that with the sheer volume of data they are dealing with, tape is absolutely the cheapest way to go.
Let me state this again: a company who is notorious for rolling their own and could totally code their own backup application and take advantage of dedupe, etc, is backing the world's most popular cloud service to tape.
It think both of these things I learned are huge. How about you?
Add comment
Comments
This is where on-board disk encryption emerged. Like tape drives they use imbedded chip to do it without speed impact.
So when technician received these disks they cannot see the data anymore.
http://gmailblog.blogspot.com/2011/02/gmail-back-soon-for-everyone.html
On the use of tape itself vs disk, I think you're right it's really just another statement that tape's not dead yet, especially in high-end environments where you never want to read the data back (because that means you've got a very big issue).
Also very nice to see a giant use tape when other options are clearly available for them, maybe this will cause a few of the disk-only proponents to think a bit.
Hope that makes sense.
Having said that, there is also a very valid place for tape in almost all backup environments.
Why wouldn't they? It's in their best interest to sell you more disk even if you could use more efficient, reliable or cost effective tech I would have thought. It's a strategy I've seen quite frequently by a number of vendors pushing disk only solutions, without naming anyone specific :)
Isn't that why storage vendors pushed VTL so hard? Anyway, it's a theory of mine that I'm happy to have rebutted
As Public Enemy put so succinctly, "Don't Believe The Hype". The push to implement disk only solutions is a strategy the vendors use to push more disk sales on the customer, but when all is said and done, it doesn't matter how many copies of the data you have, it only takes a few lines of bad code to lose decades of valuable information.
On the flipside, tape should be used only after consideration of what it is you actually need to protect and retain given your state or national legal obligations for data retention. It's unrealistic to expect a business to put every byte of data down to physical media designed to last 30 years in controlled conditions.
My 2 cents :)
RSS feed for comments to this post