Dark Reading is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

Operational Security

8/16/2017
12:45 PM
Andy Patrizio
Andy Patrizio
Andy Patrizio
50%
50%

Will GDPR Be the Death of Big Data?

The EU's General Data Protection Regulation (GDPR) will make the landscape shift for big data users around the world.

The Law of Unintended Consequences says there are unforeseen, unintended outcomes to purposeful actions. Companies working in Europe are about to get a lesson in that when the General Data Protection Regulation (GDPR) goes into effect in May 2018.

GDPR brings tough new rules mandating data handling, transparency, highly regulated usage policies, and consumer-friendly privacy terms for EU citizens. Any company that wants to do business with European residents will need to comply with GDPR or face stiff financial penalties.

Since it hasn’t gone into effect yet, we don't know all of the unintended effects, but one can be seen coming a way off: the impact of GDPR on big data and analytics projects.

GDPR is different from American regulations in that it is overarching of all industries, whereas in the US, regulation of content and data varies from one industry to the next. Health care and banking are subject to very strict rules, while retail is more freewheeling.

GDPR gives European consumers the power to control how their individual data is gathered and used. More important, it gives them the right to demand changes to their data, including removal. In big data scenarios, companies are used to doing whatever they want with the data they collect and rarely go back and make changes.

So, can you imagine the chaos, not to mention demand on resources, when Europeans start demanding changes or removal of their information from data stores?

Also, companies must be able to assess whether the data is being used in a manner that has consent from the owner and is acquired in the proper way. Companies will no longer be able to collect data from people for one reason, and then use it for a different reason. A company cannot collect sales information and then use it to predict future buying patterns, for example.

GDPR gives European citizens incredible influence and control over their personal data, and it puts short time limits and answering their requests, so you need to know where that data is quickly. GDPR gives citizens:

  • The right to be forgotten and have their data erased
  • Access to their information, so they know exactly what data is being processed where and for what purpose
  • The right to receive a copy of the personal data concerning them
  • The right to question and challenge decisions that affect them that have been made on a purely algorithmic basis

If you are running real-time analytics, do you really want to have to drop everything and answer these requests? Well, you will. But if all teams are aligned with GDPR compliance, you can minimize the pain of consumer requests. Communication between teams is key here, and inter-office communication in some companies is notoriously bad. Perhaps those EU fines will motivate your people.

In a white paper entitled "Five Essential Pillars of Big Data GDPR Compliance," data science platform developer Dataiku argues that GDPR doesn't mean the end of data science, but companies will have to develop a more controlled method of data collection so they don’t get in trouble with the new regulations.

The changes in GDPR will certainly require shifts in organizational structure and processes, most notably staffing, says Dataiku. New data governance rules will have to be implemented across the entire company, from IT to marketing to customer support.


Track the heartbeat of the virtualization movement with Light Reading at the NFV & Carrier SDN event in Denver. There's still time to register for this exclusive opportunity to learn from and network with industry experts -- communications service providers get in free!

Organizations will need to take stock of where all data is stored and ensure that it is accessible to make request changes. Data team leaders should be able to easily understand and audit data sources, who has access to what, and what sources are being used for which projects.

This means keeping all of the data in a single, centralized store, and big data doesn't work that way. It frequently keeps multiple data stores from multiple sources. You don't have to change your data storage methods with GDPR but it sure will be a lot easier if everything is in one place.

In a way, GDPR might force you to clean up your data. Rather than just blindly sucking up everything and filling your data lakes, you will be forced to practice "good data hygiene," as it were. In the end, if you know your data better to comply with the overtly intrusive nature of GDPR -- and let's face it, it really does stick its nose way into your business -- you will have better data to work with.

GDPR means the end of anything-goes data collection, but it doesn't have to mean the end of data gathering and analytics. If done right, it could result in better analytics as you keep your data clean and relevant.

Related posts:

— Andy Patrizio has been a technology journalist for more than 20 years and remembers back when Internet access was only available through his college mainframe. He has written for InformationWeek, Byte, Dr. Dobb's Journal, eWeek, Computerworld and Network World.

Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
Edge-DRsplash-10-edge-articles
I Smell a RAT! New Cybersecurity Threats for the Crypto Industry
David Trepp, Partner, IT Assurance with accounting and advisory firm BPM LLP,  7/9/2021
News
Attacks on Kaseya Servers Led to Ransomware in Less Than 2 Hours
Robert Lemos, Contributing Writer,  7/7/2021
Commentary
It's in the Game (but It Shouldn't Be)
Tal Memran, Cybersecurity Expert, CYE,  7/9/2021
Register for Dark Reading Newsletters
White Papers
Video
Cartoon
Current Issue
How Enterprises are Attacking the Cybersecurity Problem
Concerns over supply chain vulnerabilities and attack visibility drove some significant changes in enterprise cybersecurity strategies over the past year. Dark Reading's 2021 Strategic Security Survey showed that many organizations are staying the course regarding the use of a mix of attack prevention and threat detection technologies and practices for dealing with cyber threats.
Flash Poll
Twitter Feed
Dark Reading - Bug Report
Bug Report
Enterprise Vulnerabilities
From DHS/US-CERT's National Vulnerability Database
CVE-2021-3454
PUBLISHED: 2021-10-19
Truncated L2CAP K-frame causes assertion failure. Zephyr versions >= 2.4.0, >= v.2.50 contain Improper Handling of Length Parameter Inconsistency (CWE-130), Reachable Assertion (CWE-617). For more information, see https://github.com/zephyrproject-rtos/zephyr/security/advisories/GHSA-fx88-6c29-...
CVE-2021-3455
PUBLISHED: 2021-10-19
Disconnecting L2CAP channel right after invalid ATT request leads freeze. Zephyr versions >= 2.4.0, >= 2.5.0 contain Use After Free (CWE-416). For more information, see https://github.com/zephyrproject-rtos/zephyr/security/advisories/GHSA-7g38-3x9v-v7vp
CVE-2021-41150
PUBLISHED: 2021-10-19
Tough provides a set of Rust libraries and tools for using and generating the update framework (TUF) repositories. The tough library, prior to 0.12.0, does not properly sanitize delegated role names when caching a repository, or when loading a repository from the filesystem. When the repository is c...
CVE-2021-31378
PUBLISHED: 2021-10-19
In broadband environments, including but not limited to Enhanced Subscriber Management, (CHAP, PPP, DHCP, etc.), on Juniper Networks Junos OS devices where RADIUS servers are configured for managing subscriber access and a subscriber is logged in and then requests to logout, the subscriber may be fo...
CVE-2021-31379
PUBLISHED: 2021-10-19
An Incorrect Behavior Order vulnerability in the MAP-E automatic tunneling mechanism of Juniper Networks Junos OS allows an attacker to send certain malformed IPv4 or IPv6 packets to cause a Denial of Service (DoS) to the PFE on the device which is disabled as a result of the processing of these pac...