Monday EMC announced the Data Domain Global Deduplication Appliance. I’ve had a lot to say about Data Domain’s lack of global dedupe over the years, so I’ve got to post about this.
I’ve posted a lot about the need for global deduplication here, here, and here. I believe it’s an essential feature for environments both large and small, and I’ve eagerly awaited Data Domain’s global deduplication feature for years. If I recall correctly, it’s been promised for at least three years, maybe more.
What EMC has announced (but not released) is what I would call a good step in the right direction. It’s a good step because Data Domain can finally provide dedupe across more than one appliance. I want you to hear this first, because I’m about to explain why they’ve still got a long way to go.
The reason why it’s only a step in the right direction is that it’s designed to meet a very small percentage of Data Domain’s customer base.
1. It’s only for two nodes, not multiple nodes
2. It’s only for the DD 880, the top of the DD’s liine
3. It’s only for NetBackup
4. It’s only for NetBackup users who have paid for OST
Only the very largest of EMC’s customers are going to need an 880 to start with, and only the very largest are going to need two of them. Remember that 99.7% of all businesses are small businesses. It supports NetBackup and Backup Exec, but I’d be extremely surprised if there is a single Backup Exec customer who is backing up a datacenter bigger than 140 TB, which is when you’d need a second 880. (If that’s you, I’d love to hear from you.) So essentially this product is aimed at only the largest NetBackup customers, and only those who have sprung for an OST license.
What I’ve tried to explain in previous posts is that global dedupe is not just for the largest customers. Global dedupe (AKA Multi-node dedupe) should be a standard feature on any multi-node target dedupe system. Data Domain’s Brian Biles seems to think that if you buy a smaller Data Domain box and you need more than it can supply, all you need to do is swap the head out with a bigger head. I think completely different. I think swapping a head out is a waste of money. People pay 10s of thousands of dollars (or more than $100K) just for the head. Throwing that head away and buying a new one may sound great to the company selling you the head, but to me it seems like wasted money.
So hopefully one day Data Domain will support all backup products with their global dedupe offering, and will support the feature on any node. They tell me that the code would support it, and it’s a matter of testing and support, and that they’ll do what the market tells them to do. Well, market, if you agree with me, tell them.
----- Signature and Disclaimer -----
Written by W. Curtis Preston (@wcpreston). For those of you unfamiliar with my work, I've specialized in backup & recovery since 1993. I've written the O'Reilly books on backup and have worked with a number of native and commercial tools. I am now Chief Technologist at Druva, the leading provider of cloud-based data protection and data management tools for endpoints, infrastructure, and cloud applications. These posts reflect my own opinion and are not necessarily the opinion of my employer.