Dark Reading is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

News

6/8/2010
10:25 AM
George Crump
George Crump
Commentary
50%
50%

Does Deduplication Make Storage Capacity Planning Difficult?

With all the technologies out now, and it not just deduplication, to optimize the use of primary storage capacity, the guidelines for how you estimate how much capacity you need in a given year needs to change. In some ways storage capacity planning is more difficult than it has been in the past. It has to change to keep up with the new capabilities of storage systems like thin provisioning, compression and deduplication.

With all the technologies out now, and it not just deduplication, to optimize the use of primary storage capacity, the guidelines for how you estimate how much capacity you need in a given year needs to change. In some ways storage capacity planning is more difficult than it has been in the past. It has to change to keep up with the new capabilities of storage systems like thin provisioning, compression and deduplication.Storage capacity planning of a few years ago seems like a relatively simple task compared with the capacity planning of today. You estimated the amount of capacity that you were going to need based on organic growth and new application needs, then doubled that number and ordered the storage. In many cases no one batted an eye to the process. If you apply that same logic today you may end of with 50% or more of your capacity purchase never being used. In fact several vendors are claiming, and even guaranteeing, that you will need less storage if you replace your current storage solution with theirs.

You could continue to use the old math when calculating storage capacity needs and enjoy all the extra free capacity. It is important to remember though that storage is not wine, it does not get more expensive with age, unused capacity is wasted budget dollars as well as power and cooling. The time has come to factor all these techniques into your next capacity or even storage system upgrade. Of these capabilities deduplication, compression and thin provisioning probably will have the most impact.

Primary storage deduplication has been discounted by some in the industry. There are concerns about performance impact and data integrity; both those concerns are technology issues and either are or will be overcome. Some solutions are now claiming micro seconds of latency and not altering the data format. The other and more legitimate concern is how much duplicate data do you really have on primary storage. In the past I would say this is a valid concern, until server and desktop virtualization. Now there can be TB's and TB's of redundant data on the system. Deduplication can address that problem and result in massive savings. Estimating how much deduplication should factor into your capacity planning is difficult. If the environment is going to be heavy on the virtualization side, I would suggest at least a 3:1 reduction in the amount of storage you were going to purchase maybe more.

Compression is another optimization technique to consider. Compression gains optimization across almost all files, it does not require duplicate data. The data does need to be compressible of course but in almost every case the net is at least a 2:1 gain. In most cases compression is not an inhibitor to deduplication, most of the solutions work together, some are even integrated.

Thin provisioning helps in an area that deduplication and compression do not, capacity that is allocated but not in use. Essentially storage that is captive to a particular server. You can't compress or deduplicate something that is not there. The only way to optimize this capacity is to free it from being bound to a particular server. As we discuss in our Thin Provisioning White Paper, the technology is no longer limited to optimizing new application deployment but also to ongoing application use. Modern thin provisioning technology can reclaim deleted space from volumes as well.

In our next entry we will discuss how to roll all this information together to plan your next capacity upgrade or to plan a new storage system purchase.

Track us on Twitter: http://twitter.com/storageswiss

Subscribe to our RSS feed.

George Crump is lead analyst of Storage Switzerland, an IT analyst firm focused on the storage and virtualization segments. Find Storage Switzerland's disclosure statement here.

Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
Stop Defending Everything
Kevin Kurzawa, Senior Information Security Auditor,  2/12/2020
Small Business Security: 5 Tips on How and Where to Start
Mike Puglia, Chief Strategy Officer at Kaseya,  2/13/2020
5 Common Errors That Allow Attackers to Go Undetected
Matt Middleton-Leal, General Manager and Chief Security Strategist, Netwrix,  2/12/2020
Register for Dark Reading Newsletters
White Papers
Video
Cartoon
Current Issue
6 Emerging Cyber Threats That Enterprises Face in 2020
This Tech Digest gives an in-depth look at six emerging cyber threats that enterprises could face in 2020. Download your copy today!
Flash Poll
How Enterprises Are Developing and Maintaining Secure Applications
How Enterprises Are Developing and Maintaining Secure Applications
The concept of application security is well known, but application security testing and remediation processes remain unbalanced. Most organizations are confident in their approach to AppSec, although others seem to have no approach at all. Read this report to find out more.
Twitter Feed
Dark Reading - Bug Report
Bug Report
Enterprise Vulnerabilities
From DHS/US-CERT's National Vulnerability Database
CVE-2019-20477
PUBLISHED: 2020-02-19
PyYAML 5.1 through 5.1.2 has insufficient restrictions on the load and load_all functions because of a class deserialization issue, e.g., Popen is a class in the subprocess module. NOTE: this issue exists because of an incomplete fix for CVE-2017-18342.
CVE-2019-20478
PUBLISHED: 2020-02-19
In ruamel.yaml through 0.16.7, the load method allows remote code execution if the application calls this method with an untrusted argument. In other words, this issue affects developers who are unaware of the need to use methods such as safe_load in these use cases.
CVE-2011-2054
PUBLISHED: 2020-02-19
A vulnerability in the Cisco ASA that could allow a remote attacker to successfully authenticate using the Cisco AnyConnect VPN client if the Secondary Authentication type is LDAP and the password is left blank, providing the primary credentials are correct. The vulnerabilities is due to improper in...
CVE-2015-0749
PUBLISHED: 2020-02-19
A vulnerability in Cisco Unified Communications Manager could allow an unauthenticated, remote attacker to conduct a cross-site scripting (XSS) attack on the affected software. The vulnerabilities is due to improper input validation of certain parameters passed to the affected software. An attacker ...
CVE-2015-9543
PUBLISHED: 2020-02-19
An issue was discovered in OpenStack Nova before 18.2.4, 19.x before 19.1.0, and 20.x before 20.1.0. It can leak consoleauth tokens into log files. An attacker with read access to the service's logs may obtain tokens used for console access. All Nova setups using novncproxy are affected. This is rel...