Dark Reading is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.


08:23 PM
Adrian Lane
Adrian Lane
Quick Hits

Security Implications Of Big Data Strategies

Big data can introduce new access challenges -- and new security vulnerabilities -- to your organization. Here are some tips for keeping data safe

[Excerpted from "Security Implications Of Big Data Strategies," a new report posted this week on Dark Reading's Database Security Tech Center.]

The term "big data" is often misunderstood. In fact, it has been used so often, by so many people to push such specific agendas that the term has become almost meaningless.

Yes, big data is storing and processing very large data sets. However, it embodies a lot more than that.

When trying to get a handle on big data, it's helpful to consider it more an idea than a specific size or technology. In its simplest terms, the big data phenomenon is driven by the intersection of three trends: mountains of data that contain valuable information, the abundance of cheap commodity computing resources and virtually free analytics tools. When talking about security of big data environments, it's the last item -- virtually free analytics tools -- that often raise security concerns.

As of this writing, there are more than 120 variations of big data management systems focusing on different data types (for example, geolocation data, documents and tuple storage).

These systems use many different query models; different data storage models; and different task management, orchestration and resource management tools. While big data is often described as anti-relational (as shown by the term "NoSQL"), that concept also fails to capture the essence of big data.

It's true that big data implementations cast off many of the core features of relational databases to get around the associated performance issues, but make no mistake: Some big data environments offer relational structures, transactional consistency and structured query processing.

Since conventional definitions fail to capture the essence of big data, think about it in terms of the key elements that comprise big data environments. They use many nodes for distributed data storage and management.

They store multiple copies of data, "sharding" pieces of data across multiple nodes. This provides the benefits of fail-safe operation in the event any single node fails, and it means the data queries move to the data, where processing resources are available. It's this distributed cluster of data nodes that cooperate with each other to handle data management and data queries that makes big data different than "big iron."

The essential characteristics of big data -- the things that allow it to handle data management and processing requirements that outstrip previous data management systems, such as volume, data velocity, distributed architecture and parallel processing -- are what make securing these systems all the more difficult. The clusters are somewhat open and self-organizing, and they allow users to communicate with multiple data nodes simultaneously.

Validating which data nodes and which clients should have access to information is difficult. The elastic nature of big data means new nodes are automatically meshed into the cluster, sharing data and query results to handle client tasks.

In the mad race to do more with big data -- to add new features and push the boundaries of scalabilities -- the vast majority of development resources go to the improvement of big data scalability, ease of use and analysis capabilities.

A very low percentage of resources goes into adding security features. But you want security features embedded with the big data platforms. You want developers to be able to enable features as needed during the design and deployment phases. You want security to be just as scalable, high-performance and self-organizing as the clusters are. The problem is the security products available aren't typically included with open source systems or the majority of commercial bundles.

To find out more about the key security components behind big data -- and for a list of myths about big data, as well as seven key tips on security it -- download the free report on big data security.

Have a comment on this story? Please click "Add a Comment" below. If you'd like to contact Dark Reading's editors directly, send us a message. Adrian Lane is a Security Strategist and brings over 25 years of industry experience to the Securosis team, much of it at the executive level. Adrian specializes in database security, data security, and secure software development. With experience at Ingres, Oracle, and ... View Full Bio

Comment  | 
Print  | 
More Insights
Newest First  |  Oldest First  |  Threaded View
Navigating Security in the Cloud
Diya Jolly, Chief Product Officer, Okta,  12/4/2019
SOC 2s & Third-Party Assessments: How to Prevent Them from Being Used in a Data Breach Lawsuit
Beth Burgin Waller, Chair, Cybersecurity & Data Privacy Practice , Woods Rogers PLC,  12/5/2019
Register for Dark Reading Newsletters
White Papers
Cartoon Contest
Write a Caption, Win a Starbucks Card! Click Here
Latest Comment: Our Endpoint Protection system is a little outdated... 
Current Issue
Navigating the Deluge of Security Data
In this Tech Digest, Dark Reading shares the experiences of some top security practitioners as they navigate volumes of security data. We examine some examples of how enterprises can cull this data to find the clues they need.
Flash Poll
Rethinking Enterprise Data Defense
Rethinking Enterprise Data Defense
Frustrated with recurring intrusions and breaches, cybersecurity professionals are questioning some of the industrys conventional wisdom. Heres a look at what theyre thinking about.
Twitter Feed
Dark Reading - Bug Report
Bug Report
Enterprise Vulnerabilities
From DHS/US-CERT's National Vulnerability Database
PUBLISHED: 2019-12-10
IBM Cloud Pak System 2.3 is vulnerable to cross-site request forgery which could allow an attacker to execute malicious and unauthorized actions transmitted from a user that the website trusts. IBM X-Force ID: 158015.
PUBLISHED: 2019-12-10
IBM SmartCloud Analytics 1.3.1 through 1.3.5 could allow a remote attacker to gain unauthorized information and unrestricted control over Zookeeper installations due to missing authentication. IBM X-Force ID: 159518.
PUBLISHED: 2019-12-10
Platform System Manager in IBM Cloud Pak System 2.3 is potentially vulnerable to CVS Injection. A remote attacker could execute arbitrary commands on the system, caused by improper validation of csv file contents. IBM X-Force ID: 165179.
PUBLISHED: 2019-12-10
IBM WebSphere Application Server - Liberty is vulnerable to cross-site scripting. This vulnerability allows users to embed arbitrary JavaScript code in the Web UI thus altering the intended functionality potentially leading to credentials disclosure within a trusted session. IBM X-Force ID: 171245...
PUBLISHED: 2019-12-10
The Last.fm desktop app (Last.fm Scrobbler) through 2.1.39 on macOS makes HTTP requests that include an API key without the use of SSL/TLS. Although there is an Enable SSL option, it is disabled by default, and cleartext requests are made as soon as the app starts.