News
1/13/2011
03:13 PM
George Crump
George Crump
Commentary
Connect Directly
RSS
E-Mail
50%
50%

Backup Deduplication 2.0 - Integration

Deduplication has moved from a risky hard to explain technology to one that is almost expected by customers from a disk backup device. Next generation backup deduplication systems are going to require a new set of capabilities to make them more than just disk backup. They will have to integrate with the backup software, begin to provide power management, and there needs to be a greater focus on recovery performance.

Deduplication has moved from a risky hard to explain technology to one that is almost expected by customers from a disk backup device. Next generation backup deduplication systems are going to require a new set of capabilities to make them more than just disk backup. They will have to integrate with the backup software, begin to provide power management, and there needs to be a greater focus on recovery performance.Deduplication can now be delivered as either stand alone hardware or as a module to the backup software. In general a hardware appliance should have the advantage of being more globally useful, meaning you can send backup data streams from a variety of backup sources. Not only specific backup software but also dump commands within applications. The challenge is that in most cases the software has no idea what is going on behind the scenes, including when that data is being replicated or how to use that replicated data. Software deduplication, delivered as part of a module within a backup software application, has the advantage of a tighter integration. In other words the backup application knows that data is being deduped and should be able to leverage that fact.

Hardware deduplication vendors though are quickly embracing available API sets to integrate with backup applications. The most notable example today is Symantec's OpenStorage API that we detailed a while back in "A Backup API". With a backup API in place, hardware vendors can integrate with the backup application to improve performance and allow operations like replication to be controlled through the backup GUI instead of it having to be a separate process controlled through the backup appliance GUI.

Simple integration is just the beginning. As we discussed in our recent article "Integrating Disk Backup With Backup Software" we are beginning to see backup hardware suppliers create specific modules that will increase their integration capabilities with software, even if that software does not have a formal API set. We are also seeing some vendors provide backup application specific modules that can increase performance by offloading some of the data that needs to be processed by the deduplication system. In essence dividing the load up between the backup server and the disk backup appliance. We expect to see this trend continue.

Another challenge with backup deduplication hardware is power and space efficiency. A shortcoming of disk when compared to tape is clearly one of power consumption and with some disk systems the amount of data center floor space they consume. While you can throw plenty of darts at tape, one thing that can be denied is its power or density. In our next entry we will cover what needs to be improved when it comes to data center efficiency.

Track us on Twitter: http://twitter.com/storageswiss

Subscribe to our RSS feed.

George Crump is lead analyst of Storage Switzerland, an IT analyst firm focused on the storage and virtualization segments. Find Storage Switzerland's disclosure statement here.

Comment  | 
Print  | 
More Insights
Register for Dark Reading Newsletters
White Papers
Flash Poll
Current Issue
Cartoon
Threat Intel Today
Threat Intel Today
The 397 respondents to our new survey buy into using intel to stay ahead of attackers: 85% say threat intelligence plays some role in their IT security strategies, and many of them subscribe to two or more third-party feeds; 10% leverage five or more.
Video
Slideshows
Twitter Feed
Dark Reading - Bug Report
Bug Report
Enterprise Vulnerabilities
From DHS/US-CERT's National Vulnerability Database
CVE-2013-6306
Published: 2014-08-22
Unspecified vulnerability on IBM Power 7 Systems 740 before 740.70 01Ax740_121, 760 before 760.40 Ax760_078, and 770 before 770.30 01Ax770_062 allows local users to gain Service Processor privileges via unknown vectors.

CVE-2014-0232
Published: 2014-08-22
Multiple cross-site scripting (XSS) vulnerabilities in framework/common/webcommon/includes/messages.ftl in Apache OFBiz 11.04.01 before 11.04.05 and 12.04.01 before 12.04.04 allow remote attackers to inject arbitrary web script or HTML via unspecified vectors, which are not properly handled in a (1)...

CVE-2014-3525
Published: 2014-08-22
Unspecified vulnerability in Apache Traffic Server 4.2.1.1 and 5.x before 5.0.1 has unknown impact and attack vectors, possibly related to health checks.

CVE-2014-3563
Published: 2014-08-22
Multiple unspecified vulnerabilities in Salt (aka SaltStack) before 2014.1.10 allow local users to have an unspecified impact via vectors related to temporary file creation in (1) seed.py, (2) salt-ssh, or (3) salt-cloud.

CVE-2014-3587
Published: 2014-08-22
Integer overflow in the cdf_read_property_info function in cdf.c in file through 5.19, as used in the Fileinfo component in PHP before 5.4.32 and 5.5.x before 5.5.16, allows remote attackers to cause a denial of service (application crash) via a crafted CDF file. NOTE: this vulnerability exists bec...

Best of the Web
Dark Reading Radio
Archived Dark Reading Radio
Three interviews on critical embedded systems and security, recorded at Black Hat 2014 in Las Vegas.