Analytics
11/5/2013
05:06 PM
Connect Directly
Twitter
Twitter
RSS
E-Mail
50%
50%

IT Security From The Eyes Of Data Scientists

Enterprises will increasingly employ data science experts to help drive security analytics and risk mitigation

As IT security leaders try to base more of their day-to-day decisions on statistical analysis of relevant data coming from IT infrastructure and business processes, they're running into a skills and resource gap. Often security teams have lots of specialists with deep technical knowledge of attack techniques and trends, but they frequently lack the skills to aggregate and manipulate data in order to draw meaningful conclusions from statistical trends.

As the speed and volume of security data continues to mount, so will that gap, which is why many within the industry believe that in the coming years, an IT security team will not be complete without at least one data scientist among its ranks.

"In the past, it has always been us who has been behind the game, trying to catch up with the attackers' techniques," says Dan Mitchell, product manager of data sciences for RSA, The Security Division of EMC. "I think data science gives us the opportunity to get ahead of the attackers and have them be behind for a change."

Mitchell is among a growing legion of data scientists growing active within the IT security community, and one of several that Dark Reading caught up with to get their views on the value that their colleagues bring to the table, why enterprises need to employ more, and how organizations can develop talent and embed these experts into their security practices.

The complex chain of techniques that attackers today use to infiltrate IT resources and steal data makes it absolutely critical that security teams spot trends and connect behaviors that span across IT infrastructure, user groups, and geographical locations.

In order to do that, it requires security to have experts that can manipulate data, visualize it, and draw conclusions from it. Not only that, the team needs to be able to build infrastructure to store data, normalize it, and develop modeling that can answer the burning questions security analysts have about anomalies that may indicate compromise -- and that infrastructure should preferably be designed to do it all automatically.

This is the exact kind of expertise a data scientist brings to the table, says Ram Keralapura, data scientist for Netskope, a cloud apps analytics and policy creation company, who explains that the CISO and data scientist have the opportunity to form a symbiotic relationship.

"Security officers have a very good understanding of the outcome they want and have identified their problems -- they want to know specific kinds of information about certain kinds of anomalies or activities that are happening in their enterprise, but they don't always know how to get that information," says Keralapura. "Data scientists are the right people to bridge this gap and provide the insights that these security officers need in order to make more informed decisions."

What's more, Mitchell explains that someone with his type of expertise can help break down a lot of the silos that currently exist in the security realm.

"So because the security industry has become so fractionalized in terms of specialty areas, data science offers a way to bring specific domain expertise and then combine that with things like machine learning, mathematical modeling and manipulating data to solve problems that extend across all specialties," he says. "It's really about creating the whole picture."

[How do you know if you've been breached? See Top 15 Indicators of Compromise.]

Whereas in the past a lot of the mathematical minds in security tended to gravitate toward specialties like encryption or authentication, Mitchell believes that many will be diverted into data science.

"There's so much more we can do mathematically to solve our problems," he says. "I think you're going to see more and more of that. It's a larger trend."

Many vendors have already been leading the trend of hiring and training more data scientists to develop analytics-based security products, but the role of the data scientist should also be a staple within enterprise IT security teams.

"The reason I think that businesses also have to be hiring data scientists is that in security, especially, a large component of the practice is data about your particular environment," says Michael Roytman, data scientist for Risk I/O, a vulnerability threat monitoring vendor. "A lot can be done to use that data to narrow down where you should be focusing on your security risks, and that's where an in-house data scientist plays a part."

And, says Keralapura, it really should be a full-time role. There are several big reasons for this, he says. First, in order to develop predictive models about the enterprise's specific data, data scientists need to develop long-term relationships with security experts on staff and deal with data on a day-to-day basis. Second, in order to accomplish real-time detection, they'll need to be around to help with response in real time. And, third, a full-time data scientist is crucial to helping forensics problems that could pop up at any time.

"When a problem happens, you need to look at data right away in order to identify what it was, why did it happen, how did it happen, and all of these different dimensions that need to be answered," says Keralapura. "These things keep happening all the time."

As enterprises seek out those with a data science background, there are two big skill sets they should be looking for. The most obvious is a high degree of mathematics and statistical analysis. The second is the coding chops of a hacker.

"You are going to want people that have some hacking ability to put things together quickly. A lot of it is going to be about changing the view quickly, and some developers may know how to program well in a long development cycle," says George Ng, data scientist for YarcData, a Cray company that focuses on graph analytics. "But if someone is trying to steal your data, the pattern isn't something you already have in production to look for -- it's something you develop on the fly."

Next page: The insider data scientist Ericka Chickowski specializes in coverage of information technology and business innovation. She has focused on information security for the better part of a decade and regularly writes about the security industry as a contributor to Dark Reading.  View Full Bio

Previous
1 of 2
Next
Comment  | 
Print  | 
More Insights
Register for Dark Reading Newsletters
White Papers
Cartoon
Current Issue
Dark Reading December Tech Digest
Experts weigh in on the pros and cons of end-user security training.
Flash Poll
Threat Intel Today
Threat Intel Today
The 397 respondents to our new survey buy into using intel to stay ahead of attackers: 85% say threat intelligence plays some role in their IT security strategies, and many of them subscribe to two or more third-party feeds; 10% leverage five or more.
Video
Slideshows
Twitter Feed
Dark Reading - Bug Report
Bug Report
Enterprise Vulnerabilities
From DHS/US-CERT's National Vulnerability Database
CVE-2014-6477
Published: 2014-11-23
Unspecified vulnerability in the JPublisher component in Oracle Database Server 11.1.0.7, 11.2.0.3, 11.2.0.4, 12.1.0.1, and 12.1.0.2 allows remote authenticated users to affect confidentiality via unknown vectors, a different vulnerability than CVE-2014-4290, CVE-2014-4291, CVE-2014-4292, CVE-2014-4...

CVE-2014-4807
Published: 2014-11-22
Sterling Order Management in IBM Sterling Selling and Fulfillment Suite 9.3.0 before FP8 allows remote authenticated users to cause a denial of service (CPU consumption) via a '\0' character.

CVE-2014-6183
Published: 2014-11-22
IBM Security Network Protection 5.1 before 5.1.0.0 FP13, 5.1.1 before 5.1.1.0 FP8, 5.1.2 before 5.1.2.0 FP9, 5.1.2.1 before FP5, 5.2 before 5.2.0.0 FP5, and 5.3 before 5.3.0.0 FP1 on XGS devices allows remote authenticated users to execute arbitrary commands via unspecified vectors.

CVE-2014-8626
Published: 2014-11-22
Stack-based buffer overflow in the date_from_ISO8601 function in ext/xmlrpc/libxmlrpc/xmlrpc.c in PHP before 5.2.7 allows remote attackers to cause a denial of service (application crash) or possibly execute arbitrary code by including a timezone field in a date, leading to improper XML-RPC encoding...

CVE-2014-8710
Published: 2014-11-22
The decompress_sigcomp_message function in epan/sigcomp-udvm.c in the SigComp UDVM dissector in Wireshark 1.10.x before 1.10.11 allows remote attackers to cause a denial of service (buffer over-read and application crash) via a crafted packet.

Best of the Web
Dark Reading Radio
Archived Dark Reading Radio
Now that the holiday season is about to begin both online and in stores, will this be yet another season of nonstop gifting to cybercriminals?