Analytics // Security Monitoring
4/19/2013
09:56 PM
50%
50%

Machine Learning Susses Out Social-Network Fraud

Machine-learning techniques can be used to detect fraud and spies on social networks based on certain features, such as the number of followers and devices used to access the network

Certain characteristics of social-network accounts have a high correlation with fraud and can be used to differentiate between real and fake accounts, a researcher presenting at the SOURCE Boston Conference said this week.

Using machine-learning techniques, Vicente Diaz, a senior security analyst with security software firm Kaspersky Lab, found that seven characteristics of Twitter profiles could identify fraudulent accounts 91 percent of the time. The number of devices from which a user accesses the service, the ratio of followers to people following an account, the average number of tweets to each person, and the number of tweets to an unknown receiver are all features that correlate strongly to fraudulent accounts, he says.

"Surprisingly, it was quite easy to identify the malicious profiles," Diaz says. "The most important thing for you to keep in mind is to select a smart set of features."

As social networks have become more popular, the questionable uses of the highly connected social circles has rapidly grown. Twitter, Facebook, and other networks are increasingly used for spam as a way to gather information on users and to distribute malware. Fake accounts controlled by a single user are an essential part of the fraud schemes. In many cases, the accounts are created, built up, and then harvested and sold off, says Chris Porter, principal with Verizon's RISK group.

"They are trying to amass a lot of followers, and then they will sell the account off to other parties," Porter says. "Organized crime could use that information to pump links to potential victims to spread malware or other schemes."

Last year, Facebook estimated that almost 9 percent of the accounts on its network are fraudulent. One security firm, Impermium, has put the number much higher: As many as [Using the social context of posts, researchers from UC Riverside create prototype Facebook app that detects social malware with 97 percent accuracy. See Application Detects Social Network Spam, Malware.]

Yet while spam and fraud are a problem, malware is far less of an issue, according to Palo Alto Networks, a network security provider. In a recent report, the company found that while 20 percent of average traffic on the network is due to social networks, malicious traffic emanating from social networks accounted for less than 0.2 percent of all threat logs.

"Facebook and Twitter have some pretty good security teams in house," says Michael Sutton, vice president of research for Web-security firm Zscaler. "Most of the scams that we see have a shelf life of about an hour. They are doing a good job."

Kaspersky's Diaz agreed. By looking at almost 13,500 profiles with 6.5 million relationships, the researcher found that most fake accounts were engaged in fraud -- not malicious attacks. The researchers used a number of machine learning techniques, such as radial-basis function (RBF) neural networks and Bayesian filtering, to recognize patterns, including 36 malicious campaigns.

In addition, he found that he could identify hacked accounts with an 80 percent accuracy rate. By monitoring accounts using a small time window, Diaz's system could detect changes in behavior that indicated an account had become compromised.

To continue his research, Diaz wants to apply his models to the large population of accounts that follow corporations and celebrities, trying to find fake accounts and understand how fraudsters use the popular crowd to fuel their own schemes.

"It will be interesting to see how many followers are real and how many are fake profiles used to boost the apparent fans of a celebrity," Diaz says.

Have a comment on this story? Please click "Add Your Comment" below. If you'd like to contact Dark Reading's editors directly, send us a message. Robert Lemos is a veteran technology journalist of more than 16 years and a former research engineer, writing articles that have appeared in Business Week, CIO Magazine, CNET News.com, Computing Japan, CSO Magazine, Dark Reading, eWEEK, InfoWorld, MIT's Technology Review, ... View Full Bio

Comment  | 
Print  | 
More Insights
Register for Dark Reading Newsletters
White Papers
Cartoon
Current Issue
Dark Reading Tech Digest, Dec. 19, 2014
Software-defined networking can be a net plus for security. The key: Work with the network team to implement gradually, test as you go, and take the opportunity to overhaul your security strategy.
Flash Poll
Video
Slideshows
Twitter Feed
Dark Reading - Bug Report
Bug Report
Enterprise Vulnerabilities
From DHS/US-CERT's National Vulnerability Database
CVE-2014-8142
Published: 2014-12-20
Use-after-free vulnerability in the process_nested_data function in ext/standard/var_unserializer.re in PHP before 5.4.36, 5.5.x before 5.5.20, and 5.6.x before 5.6.4 allows remote attackers to execute arbitrary code via a crafted unserialize call that leverages improper handling of duplicate keys w...

CVE-2013-4440
Published: 2014-12-19
Password Generator (aka Pwgen) before 2.07 generates weak non-tty passwords, which makes it easier for context-dependent attackers to guess the password via a brute-force attack.

CVE-2013-4442
Published: 2014-12-19
Password Generator (aka Pwgen) before 2.07 uses weak pseudo generated numbers when /dev/urandom is unavailable, which makes it easier for context-dependent attackers to guess the numbers.

CVE-2013-7401
Published: 2014-12-19
The parse_request function in request.c in c-icap 0.2.x allows remote attackers to cause a denial of service (crash) via a URI without a " " or "?" character in an ICAP request, as demonstrated by use of the OPTIONS method.

CVE-2014-2026
Published: 2014-12-19
Cross-site scripting (XSS) vulnerability in the search functionality in United Planet Intrexx Professional before 5.2 Online Update 0905 and 6.x before 6.0 Online Update 10 allows remote attackers to inject arbitrary web script or HTML via the request parameter.

Best of the Web
Dark Reading Radio
Archived Dark Reading Radio
Join us Wednesday, Dec. 17 at 1 p.m. Eastern Time to hear what employers are really looking for in a chief information security officer -- it may not be what you think.