Skip to content

Detecting Targeted Malicious Email

Authors: Rohan M. Amin, Julie J.C.H. Ryan, and J. René van Dorp

Publication: IEEE Security and Privacy, vol. 10, no. 3, pp. 64-71, May-June 2012, doi:10.1109/MSP.2011.154

Abstract: Targeted malicious emails (TME) for computer network exploitation have become more insidious and more widely documented in recent years. Beyond spam or phishing designed to trick users into revealing personal information, TME can exploit computer networks and gather sensitive information. They can consist of coordinated and persistent campaigns that can span years. A new email-filtering technique based on email’s persistent-threat and recipient-oriented features with a random forest classifier outperforms two traditional detection methods, SpamAssassin and ClamAV, while maintaining reasonable false positive rates.

Paper is here.

Intelligence-Driven Computer Network Defense Informed by Analysis of Adversary Campaigns and Intrusion Kill Chains

Authors: Eric M. Hutchins, Michael J. Cloppert, Rohan M. Amin, Ph.D.

Conference: Paper presented at the 6th Annual International Conference on Information Warfare and Security, Washington, DC, 2011.

Abstract: Conventional network defense tools such as intrusion detection systems and anti-virus focus on the vulnerability component of risk, and traditional incident response methodology presupposes a successful intrusion.  An evolution in the goals and sophistication of computer network intrusions has rendered these approaches insufficient for certain actors.  A new class of threats, appropriately dubbed the “Advanced Persistent Threat” (APT), represents well-resourced and trained adversaries that conduct multi-year intrusion campaigns targeting highly sensitive economic, proprietary, or national security information.  These adversaries accomplish their goals using advanced tools and techniques designed to defeat most conventional computer network defense mechanisms.  Network defense techniques which leverage knowledge about these adversaries can create an intelligence feedback loop, enabling defenders to establish a state of information superiority which decreases the adversary’s likelihood of success with each subsequent intrusion attempt.  Using a kill chain model to describe phases of intrusions, mapping adversary kill chain indicators to defender courses of action, identifying patterns that link individual intrusions into broader campaigns, and understanding the iterative nature of intelligence gathering form the basis of intelligence-driven computer network defense (CND). Institutionalization of this approach reduces the likelihood of adversary success, informs network defense investment and resource prioritization, and yields relevant metrics of performance and effectiveness. The evolution of advanced persistent threats necessitates an intelligence-based model because in this model the defenders mitigate not just vulnerability, but also the threat component of risk.

Full paper is here [backup].

Dissertation: Detecting Targeted Malicious Email through Supervised Classification of Persistent Threat and Recipient Oriented Features

Abstract: Targeted email attacks to enable computer network exploitation have become more prevalent, more insidious, and more widely documented in recent years.  Beyond nuisance spam or phishing designed to trick users into revealing personal information, targeted malicious email (TME) facilitates computer network exploitation and the gathering of sensitive information from targeted networks.  These targeted email attacks are not singular unrelated events, instead they are coordinated and persistent attack campaigns that can span years.  This dissertation surveys and categorizes existing email filtering techniques, proposes and implements new methods for detecting targeted malicious email and compares these newly developed techniques to traditional detection methods.  Current research and commercial methods for detecting illegitimate email are limited to addressing Internet scale email abuse, such as spam, but not focused on addressing targeted malicious emails.  Furthermore, conventional tools such as anti-virus are vulnerability focused examining only the binary code of an email but ignoring all relevant contextual metadata.

This study first documents the existence of TME and characterizes it as a form of malicious email attack different than spam, phishing and other conventional illegitimate email.  The quantitative research is conducted by analyzing email data from a large Fortune 500 company that has been subjected to these targeted emails.  Persistent threat features, such as threat actor locale and weaponization tools, along with recipient oriented features, such as reputation and role, are leveraged with supervised data classification algorithms to demonstrate new techniques for detection of targeted malicious email.  The specific tools, techniques, procedures, and infrastructure that a threat actor uses characterize the level and capability of a threat; the recipient’s role and repeated targeting speak to the intent of the threat.  Both sets of features are used in a random forest classifier to separate targeted malicious email from non-targeted malicious email.  Performance of this data classifier is measured and compared to conventional email filtering techniques to demonstrate the added benefit of including these features.  Performance evaluations are focused on false negative reduction since the cost of missing a targeted malicious email is far greater than the cost of mistakenly flagging a legitimate email as malicious.

Several findings are made in this study.  First, targeted malicious email demonstrates association to persistent threat features as compared to non-targeted malicious email that does not.  Second, targeted malicious email demonstrates association to recipient oriented features as compared to non-targeted malicious email that does not.  Finally, detection of targeted malicious email using persistent threat and recipient oriented features results in significantly fewer false negatives than detection of targeted malicious email using conventional email filtering techniques.  This improvement in false negative rates comes with acceptable false positive rates.

Future research can expand upon the features introduced in this study.  For example, additional persistent threat features can be harvested from file level metadata (e.g. author names, document path locations) and additional recipient oriented features can be incorporated from organization databases.  In this study, a binary outcome is defined: emails are either targeted malicious or non-targeted malicious.  Future work can explore multi-class outcomes that pair specific threat actor campaigns and targeted recipients.

Full Dissertation Here: Dissertation (ProQuest) or Dissertation (rohanamin.com)