For example if you do 40 full backups of a data set that changes slightly between sets, meaning that the deduplication ratio is fairly high, and then try to recover from the 3rd copy and then 37th copy. With some deduplication systems you will find a significant difference in the time it takes to recover that data between those two interations of the backup data set. This is certainly something to test in any deduplication system that you are evaluating to make sure your perspective vendor has addressed this issue. It is also something that all deduplication vendors need to keep working on to make sure their systems don't have that problem. Versus straight un-deduplicated disk, a small less than 5%, performance loss is probably acceptable but anything more could begin to significantly impact recovery windows.
The other area where recovery performance is going to become increasingly critical is as data protection solutions continue to add a recovery in place type of capability, as we discuss in our article "Virtualization Powered Recovery". In this instance you can leverage the fact that disk backup technology is in fact disk and running a server instance or other type of data set directly from the backup device is now possible. The performance focus shifts from fast streaming reads to purely random interactive reads. While no one is expecting primary storage like performance, deduplication hardware vendors need to make sure that they can handle this change in requirement from the deduplicated area or they may need to provide a non-deduplicated staging area, to at least keep that performance acceptable.
Another event that impacts recovery performance is what happens when a disk has failed on the backup deduplication system and you need to recover data while the rebuild is underway? We will address RAID data protection and how it is implemented on deduplicated systems in an upcoming entry.
Track us on Twitter: http://twitter.com/storageswiss
Subscribe to our RSS feed.
George Crump is lead analyst of Storage Switzerland, an IT analyst firm focused on the storage and virtualization segments. Find Storage Switzerland's disclosure statement here.