Dark Reading is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

Attacks/Breaches

11/1/2013
01:15 PM
50%
50%

Researchers Sharpen Spear-Phishing With New Tool Leveraging Social Networks

A new tool mixes data mining with natural language processing to help pen testers create more attractive spear-phishing messages

Phishing hooks more than its share of people and organizations. But just like its homophonic counterpart, phishing can always be made easier with the right bait.

At the upcoming Black Hat Regional Summit in Brazil, Trustwave researchers Joaquim Espinhara and Ulisses Albuquerque plan to do exactly that. Using a new tool they call 'µphisher' (read as microphisher), the researchers say they have found a way to gather the digital breadcrumbs users leave on the Internet through social networks, mailing lists, online forums, and beyond.

With a mix of data mining and natural language processing [NLP], the tool can find patterns in the way a target communicates online and about what, so the information can be used to craft a more enticing attack.

"µphisher builds a database of social network status updates and makes these available for building user profiles," Albuquerque says.

Those profiles, he explains, focus on text provided by a target of interest and allow pen testers to build support data structures for the most commonly used words, as well as the people the target most frequently interacts with on social networks, hashtags, and gelocation information. With that in hand, the tool uses the information to rank how close phony content is to legitimate content produced by the target.

"We check sentence length, if the words are typically used by the target, and if the referenced users and hashtags match those actually used by [them]," Albuquerque says.

"Since different social media networks are used for different purposes ... all [social] networks are possible targets," he says. "Professional content, geolocation, pictures and movies, interacting with friends -- every one of these activities involves a different 'online persona' by the user, and the phrasing, words, and sentence length will vary wildly between content written for each of these purposes. So we don't focus on one particular social network because that would mean focusing on content which might not look legitimate on other social networks."

The tool does not try to interpret the meaning of what the user is talking about; therefore, slang, abbreviations, and other "non-standard" words would end up in its dictionary even though the natural language processing engine might not be able to categorize them properly.

"Since the tool was developed to support quick engagements, we do not want to have the consultant/penetration tester spending too much time trying to analyze and infer intention on the subject of interest," the researcher says. "We just want to help produce content that looks like it was written by the target. Thus, anything which is not proper English will be treated as noise, but will end up in our dictionaries,and will be still checked against when evaluating user-provided content."

The tool uses the official APIs for obtaining data, and in their talk the researchers plan to touch on potential legal implications of using the tool. According to Albuquerque, the user must generate the required tokens with each social network, and the tool itself does not try to be stealthy in its activities. For that reason, it may be subject to restrictions by some social networks.

"We also authenticate against the networks using the actual user identity of the person operating the tool when fetching data -- which should be enough to transfer most of the liability to them when using the tool for not-so-legitimate scenarios," he says. "We certainly do not wish it to be used as an umbrella to hide malicious users against an application-wide identity in order to harvest data from unknowing targets."

The researchers' presentation is scheduled for Nov. 26 at the summit, which will be held at the Transamerica Expo Center in Sao Paulo.

Have a comment on this story? Please click "Add Your Comment" below. If you'd like to contact Dark Reading's editors directly, send us a message. Brian Prince is a freelance writer for a number of IT security-focused publications. Prior to becoming a freelance reporter, he worked at eWEEK for five years covering not only security, but also a variety of other subjects in the tech industry. Before that, he worked as a ... View Full Bio

Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
Stop Defending Everything
Kevin Kurzawa, Senior Information Security Auditor,  2/12/2020
Small Business Security: 5 Tips on How and Where to Start
Mike Puglia, Chief Strategy Officer at Kaseya,  2/13/2020
Architectural Analysis IDs 78 Specific Risks in Machine-Learning Systems
Jai Vijayan, Contributing Writer,  2/13/2020
Register for Dark Reading Newsletters
White Papers
Video
Cartoon Contest
Current Issue
6 Emerging Cyber Threats That Enterprises Face in 2020
This Tech Digest gives an in-depth look at six emerging cyber threats that enterprises could face in 2020. Download your copy today!
Flash Poll
How Enterprises Are Developing and Maintaining Secure Applications
How Enterprises Are Developing and Maintaining Secure Applications
The concept of application security is well known, but application security testing and remediation processes remain unbalanced. Most organizations are confident in their approach to AppSec, although others seem to have no approach at all. Read this report to find out more.
Twitter Feed
Dark Reading - Bug Report
Bug Report
Enterprise Vulnerabilities
From DHS/US-CERT's National Vulnerability Database
CVE-2020-9016
PUBLISHED: 2020-02-16
Dolibarr 11.0 allows XSS via the joinfiles, topic, or code parameter, or the HTTP Referer header.
CVE-2020-9013
PUBLISHED: 2020-02-16
Arvato Skillpipe 3.0 allows attackers to bypass intended print restrictions by deleting <div id="watermark"> from the HTML source code.
CVE-2020-9007
PUBLISHED: 2020-02-16
Codoforum 4.8.8 allows self-XSS via the title of a new topic.
CVE-2020-9012
PUBLISHED: 2020-02-16
A cross-site scripting (XSS) vulnerability in the Import People functionality in Gluu Identity Configuration 4.0 allows remote attackers to inject arbitrary web script or HTML via the filename parameter.
CVE-2019-20456
PUBLISHED: 2020-02-16
Goverlan Reach Console before 9.50, Goverlan Reach Server before 3.50, and Goverlan Client Agent before 9.20.50 have an Untrusted Search Path that leads to Command Injection and Local Privilege Escalation via DLL hijacking.