In my post that compared the performance of various target deduplication vendors, I listed the Symantec NetBackup 5000 as having 7166 MB/s of throughput. That was based on their data sheet (as were all numbers in that post). What I later found out via multiple sources including this discussion that this number requires a NetBackup Media Server with the media server dedupe option enabledPureDisk client. That is source dedupe, not target dedupe. I’m not saying source dedupe is not valid, but I am saying that performance is measured very differently in that space and it doesn’t make sense to compare the two in the same table. Even if I was to do that, I would have to add all the other source dedupe vendors.
Since Symantec chose not to publish target dedupe numbers for regular NetBackup backups, I have put an N/A in the performance column in my previous post.
Shame on you, Symantec for not having this caveat listed in your brochure. You won’t fool me again.
Update (11/17): Symantec has clarified what was in that blog post (and posted a thread into the comment thread of that post) to say that this was media server dedupe, not client dedupe. So I have crossed out what I originally said and updated the blog entry accordingly. I’m going to republish the number, as it is equivalent to what happens with Data Domain’s Boost, where the data is deduped on the media server before getting to the appliance. But I still think that they should publish the throughput number without dedupe.
----- Signature and Disclaimer -----
Written by W. Curtis Preston (@wcpreston). For those of you unfamiliar with my work, I've specialized in backup & recovery since 1993. I've written the O'Reilly books on backup and have worked with a number of native and commercial tools. I am now Chief Technologist at Druva, the leading provider of cloud-based data protection and data management tools for endpoints, infrastructure, and cloud applications. These posts reflect my own opinion and are not necessarily the opinion of my employer.