invited-talk

Adversarial machine learning

Authors:
Ling Huang

Intel Labs Berkeley, Berkeley, CA, USA

Intel Labs Berkeley, Berkeley, CA, USA
View Profile

,
Anthony D. Joseph

UC Berkeley, Berkeley, CA, USA

UC Berkeley, Berkeley, CA, USA
View Profile

,
Blaine Nelson

University of Tubingen, Tubingen, Germany

University of Tubingen, Tubingen, Germany
View Profile

,
Benjamin I.P. Rubinstein

Microsoft, Mountain View, CA, USA

Microsoft, Mountain View, CA, USA
View Profile

,
J. D. Tygar

UC Berkeley, Berkeley, CA, USA

UC Berkeley, Berkeley, CA, USA
View Profile

AISec '11: Proceedings of the 4th ACM workshop on Security and artificial intelligenceOctober 2011Pages 43–58https://doi.org/10.1145/2046684.2046692

Published:21 October 2011Publication History

AISec '11: Proceedings of the 4th ACM workshop on Security and artificial intelligence

Pages 43–58

ABSTRACT

In this paper (expanded from an invited talk at AISEC 2010), we discuss an emerging field of study: adversarial machine learning---the study of effective machine learning techniques against an adversarial opponent. In this paper, we: give a taxonomy for classifying attacks against online machine learning algorithms; discuss application-specific factors that limit an adversary's capabilities; introduce two models for modeling an adversary's capabilities; explore the limits of an adversary's knowledge about the algorithm, feature space, training, and input data; explore vulnerabilities in machine learning algorithms; discuss countermeasures against attacks; introduce the evasion challenge; and discuss privacy-preserving learning techniques.

References

B. Barak, K. Chaudhuri, C. Dwork, S. Kale, F. McSherry, and K. Talwar. Privacy, accuracy, and consistency too: a holistic solution to contingency table release. In PODS'07, pages 273--282, 2007. Google ScholarDigital Library
M. Barreno, B. Nelson, R. Sears, A. D. Joseph, and J. D. Tygar. Can machine learning be secure? In ASIACCS'06, pages 16--25, 2006. Google ScholarDigital Library
A. Beimel, S. Kasiviswanathan, and K. Nissim. Bounds on the sample complexity for private learning and private data release. In Theory of Crypto., volume 5978 of LNCS, pages 437--454. 2010. Google ScholarDigital Library
B. Biggio, G. Fumera, and F. Roli. Multiple classifier systems under attack. In Proc. Int. Workshop Multiple Classifier Systems, volume 5997, pages 74--83, 2010. Google ScholarDigital Library
C. M. Bishop. Pattern Recognition and Machine Learning. Springer, 2006. Google ScholarDigital Library
A. Blum, C. Dwork, F. McSherry, and K. Nissim. Practical privacy: the SuLQ framework. In PODS'05, pages 128--138, 2005. Google ScholarDigital Library
A. Blum, K. Ligett, and A. Roth. A learning theory approach to non-interactive database privacy. In STOC'08, pages 609--618, 2008. Google ScholarDigital Library
M. Bruckner and T. Scheffer. Nash equilibria of static prediction games. In NIPS, pages 171--179. 2009.Google Scholar
N. Cesa-Bianchi and G. Lugosi. Prediction, Learning, and Games. Cambridge University Press, 2006. Google ScholarDigital Library
K. Chaudhuri and C. Monteleoni. Privacy-preserving logistic regression. In NIPS, pages 289--296, 2009.Google ScholarDigital Library
K. Chaudhuri, C. Monteleoni, and A. D. Sarwate. Differentially private empirical risk minimization. JMLR, 12:1069--1109, 2011. Google ScholarDigital Library
S. P. Chung and A. K. Mok. Allergy attack against automatic signature generation. In RAID'09, volume 4219 of LNCS, pages 61--80, 2006. Google ScholarDigital Library
S. P. Chung and A. K. Mok. Advanced allergy attacks: Does a corpus really help? In RAID'07, volume 4637 of LNCS, pages 236--255, 2007. Google ScholarDigital Library
N. Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines. Cambridge University Press, 2000. Google ScholarDigital Library
C. Croux, P. Filzmoser, and M. R. Oliveira. Algorithms for projection-pursuit robust principal component analysis. Chemometrics and Intelligent Laboratory Systems, 87(2):218--225, 2007.Google ScholarCross Ref
N. Dalvi, P. Domingos, Mausam, S. Sanghai, and D. Verma. Adversarial classification. In KDD'04, pages 99--108, 2004. Google ScholarDigital Library
I. Dinur and K. Nissim. Revealing information while preserving privacy. In PODS'03, pages 202--210, 2003. Google ScholarDigital Library
Y. Duan, J. Canny, and J. Zhan. P4P: Practical large-scale privacy-preserving distributed computation robust against malicious users. In USENIX Security, pages 207--222, 2010. Google ScholarDigital Library
C. Dwork. Differential privacy. In ICALP'06, pages 1--12, 2006. Google ScholarDigital Library
C. Dwork. A firm foundation for private data analysis. Comms. ACM, 54(1):86--95, 2011. Google ScholarDigital Library
C. Dwork and J. Lei. Differential privacy and robust statistics. In STOC'09, pages 371--380, 2009. Google ScholarDigital Library
C. Dwork, F. McSherry, K. Nissim, and A. Smith. Calibrating noise to sensitivity in private data analysis. In TCC'06, pages 265--284, 2006. Google ScholarDigital Library
C. Dwork, F. McSherry, and K. Talwar. The price of privacy and the limits of LP decoding. In STOC'07, pages 85--94, 2007. Google ScholarDigital Library
C. Dwork, M. Naor, O. Reingold, G. N. Rothblum, and S. Vadhan. On the complexity of differentially private data release: efficient algorithms and hardness results. In STOC'09, pages 381--390, 2009. Google ScholarDigital Library
C. Dwork and S. Yekhanin. New efficient attacks on statistical disclosure control mechanisms. In CRYPTO'08, pages 469--480, 2008. Google ScholarDigital Library
R. A. Fisher. Question 14: Combining independent tests of significance. American Statistician, 2(5):30--31, 1948.Google Scholar
P. Fogla and W. Lee. Evading network anomaly detection systems: Formal reasoning and practical techniques. In CCS'06, pages 59--68, 2006. Google ScholarDigital Library
A. Globerson and S. Roweis. Nightmare at test time: Robust learning by feature deletion. In ICML'06, pages 353--360, 2006. Google ScholarDigital Library
R. Hall, S. Fienberg, and Y. Nardi. Secure multiparty linear regression based on homomorphic encryption. J. Official Statistics, 2011. To appear.Google Scholar
F. R. Hampel, E. M. Ronchetti, P. J. Rousseeuw, and W. A. Stahel. Robust Statistics: The Approach Based on Influence Functions. Probability and Mathematical Statistics. John Wiley and Sons, 1986.Google Scholar
M. Hardt and K. Talwar. On the geometry of differential privacy. In STOC'10, pages 705--714, 2010. Google ScholarDigital Library
M. Kantarcioglu, B. Xi, and C. Clifton. Classifier evaluation and attribute selection against active adversaries. Technical Report 09-01, Purdue University, February 2009.Google Scholar
S. P. Kasiviswanathan, H. K. Lee, K. Nissim, S. Raskhodnikova, and A. Smith. What can we learn privately? In FOCS'08, pages 531--540, 2008. Google ScholarDigital Library
A. Kerckhoffs. La cryptographie militaire. Journal des Sciences Militaires, 9:5--83, January 1883.Google Scholar
M. Kloft and P. Laskov. Online anomaly detection under adversarial impact. In AISTATS'10, 2010.Google Scholar
A. Lakhina, M. Crovella, and C. Diot. Diagnosing network-wide traffic anomalies. In SIGCOMM'04, pages 219--230, 2004. Google ScholarDigital Library
P. Laskov and M. Kloft. A framework for quantitative security analysis of machine learning. In AISec'09, pages 1--4, 2009. Google ScholarDigital Library
C. Liu and S. Stamm. Fighting unicode-obfuscated spam. In Proceedings of the Anti-Phishing Working Groups 2nd Annual eCrime Researchers Summit, pages 45--59, 2007. Google ScholarDigital Library
D. Lowd and C. Meek. Adversarial learning. In KDD'05, pages 641--647, 2005. Google ScholarDigital Library
D. Lowd and C. Meek. Good word attacks on statistical spam filters. In CEAS'05, 2005.Google Scholar
A. Machanavajjhala, D. Kifer, J. Gehrke, and M. Venkitasubramaniam. l-diversity: Privacy beyond k-anonymity. ACM Trans. KDD, 1(1), 2007. Google ScholarDigital Library
M. V. Mahoney and P. K. Chan. An analysis of the 1999 DARPA/Lincoln Laboratory evaluation data for network anomaly detection. In RAID'03, volume 2820 of LNCS, pages 220--237, 2003.Google Scholar
F. McSherry and I. Mironov. Differentially private recommender systems: building privacy into the net. In KDD'09, pages 627--636, 2009. Google ScholarDigital Library
F. McSherry and K. Talwar. Mechanism design via differential privacy. In FOCS'07, pages 94--103, 2007. Google ScholarDigital Library
T. A. Meyer and B. Whateley. SpamBayes: Effective open-source, Bayesian based, email classification system. In CEAS'04, 2004.Google Scholar
T. Mitchell. Machine Learning. McGraw Hill, 1997. Google ScholarDigital Library
B. Nelson, M. Barreno, F. J. Chi, A. D. Joseph, B. I. P. Rubinstein, U. Saini, C. Sutton, J. D. Tygar, and K. Xia. Exploiting machine learning to subvert your spam filter. In LEET'08, pages 1--9, 2008. Google ScholarDigital Library
B. Nelson, M. Barreno, F. J. Chi, A. D. Joseph, B. I. P. Rubinstein, U. Saini, C. Sutton, J. D. Tygar, and K. Xia. Misleading learners: Co-opting your spam filter. In J. J. P. Tsai and P. S. Yu, editors, Machine Learning in Cyber Trust: Security, Privacy, Reliability, pages 17--51. Springer, 2009.Google Scholar
B. Nelson and A. D. Joseph. Bounding an attack's complexity for a simple learning model. In Proc. Workshop on Tackling Computer Systems Problems with Machine Learning Techniques, 2006.Google Scholar
B. Nelson, B. I. P. Rubinstein, L. Huang, A. D. Joseph, S. hon Lau, S. Lee, S. Rao, A. Tran, and J. D. Tygar. Near-optimal evasion of convex-inducing classifiers. In AISTATS, 2010.Google Scholar
B. Nelson, B. I. P. Rubinstein, L. Huang, A. D. Joseph, and J. D. Tygar. Classifier evasion: Models and open problems (position paper). In Proc. Workshop on Privacy & Security issues in Data Mining and Machine Learning, 2010. Google ScholarDigital Library
J. Newsome, B. Karp, and D. Song. Paragraph: Thwarting signature learning by training maliciously. In RAID, volume 4219 of LNCS, pages 81--105, 2006. Google ScholarDigital Library
L. Rademacher and N. Goyal. Learning convex bodies is hard. In COLT, pages 303--308, 2009.Google Scholar
H. Ringberg, A. Soule, J. Rexford, and C. Diot. Sensitivity of PCA for traffic anomaly detection. In SIGMETRICS, pages 109--120, 2007. Google ScholarDigital Library
G. Robinson. A statistical approach to the spam problem. Linux Journal, Mar. 2003. Google ScholarDigital Library
B. I. P. Rubinstein, P. L. Bartlett, L. Huang, and N. Taft. Learning in a large function space: Privacy-preserving mechanisms for SVM learning, 2009. In submission; http://arxiv.org/abs/0911.5708v1.Google Scholar
B. I. P. Rubinstein, B. Nelson, L. Huang, A. D. Joseph, S. hon Lau, S. Rao, N. Taft, and J. D. Tygar. ANTIDOTE: Understanding and defending against poisoning of anomaly detectors. In A. Feldmann and L. Mathy, editors, IMC'09, pages 1--14, New York, NY, USA, November 2009. ACM. Google ScholarDigital Library
D. Sculley, G. M. Wachman, and C. E. Brodley. Spam filtering using inexact string matching in explicit feature space with on-line linear classifiers. In TREC'06, 2006.Google Scholar
A. Smith. Privacy-preserving statistical estimation with optimal convergence rates. In STOC'2011, pages 813--822, 2011. Google ScholarDigital Library
S. J. Stolfo, W. jen Li, S. Hershkop, K. Wang, C. wei Hu, and O. Nimeskern. Detecting viral propagations using email behavior profiles. In ACM Trans. Internet Technology, May 2004.Google Scholar
L. Sweeney. k-anonymity: a model for protecting privacy. Int. J. Uncertainty, Fuzziness and Knowledge-based Systems, 10(5):557--570, 2002. Google ScholarDigital Library
K. M. C. Tan, K. S. Killourhy, and R. A. Maxion. Undermining an anomaly-based intrusion detection system using common exploits. In RAID'02, volume 2516 of LNCS, pages 54--73, 2002. Google ScholarDigital Library
S. Venkataraman, A. Blum, and D. Song. Limits of learning-based signature generation with adversaries. In NDSS'08, 2008.Google Scholar
D. Wagner and P. Soto. Mimicry attacks on host-based intrusion detection systems. In CCS'02, pages 255--264, 2002. Google ScholarDigital Library
G. L. Wittel and S. F. Wu. On attacking statistical spam filters. In CEAS'04, 2004.Google Scholar

Index Terms

Adversarial machine learning
1. Security and privacy
  1. Intrusion/anomaly detection and malware mitigation
  2. Systems security
    1. Operating systems security

Recommendations

Can machine learning be secure?
ASIACCS '06: Proceedings of the 2006 ACM Symposium on Information, computer and communications security

Machine learning systems offer unparalled flexibility in dealing with evolving input in a variety of applications, such as intrusion detection systems and spam e-mail filtering. However, machine learning algorithms themselves can be a target of attack ...
Read More
Open problems in the security of learning
AISec '08: Proceedings of the 1st ACM workshop on Workshop on AISec

Machine learning has become a valuable tool for detecting and preventing malicious activity. However, as more applications employ machine learning techniques in adversarial decision-making situations, increasingly powerful attacks become possible ...
Read More
Understanding the risk factors of learning in adversarial environments
AISec '11: Proceedings of the 4th ACM workshop on Security and artificial intelligence

Learning for security applications is an emerging field where adaptive approaches are needed but are complicated by changing adversarial behavior. Traditional approaches to learning assume benign errors in data and thus may be vulnerable to adversarial ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
AISec '11: Proceedings of the 4th ACM workshop on Security and artificial intelligence
October 2011
124 pages
ISBN:9781450310031
DOI:10.1145/2046684
General Chair:
Yan Chen
Northwestern University, USA
,
Program Chairs:
Alvaro A. Cárdenas
Fujitsu Laboratories of America, USA
,
Rachel Greenstadt
Drexel University, USA
,
Ben Rubinstein
Microsoft Research, USA
Copyright © 2011 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 21 October 2011
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
adversarial learning
computer security
game theory
intrusion detection
machine learning
security metrics
spam filters
statistical learning
Qualifiers
- invited-talk
Conference

Acceptance Rates
Overall Acceptance Rate94of231submissions,41%
Upcoming Conference
CCS '24

Sponsor:

sigsac

ACM SIGSAC Conference on Computer and Communications Security

October 14 - 18, 2024

Salt Lake City , UT , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 628
  Total Citations
  View Citations
- 8,046
  Total Downloads
- Downloads (Last 12 months)847
- Downloads (Last 6 weeks)97
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Adversarial machine learning

AISec '11: Proceedings of the 4th ACM workshop on Security and artificial intelligence

ABSTRACT

References

Cited By

Index Terms

Recommendations

Can machine learning be secure?

Open problems in the security of learning

Understanding the risk factors of learning in adversarial environments

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Adversarial machine learning

AISec '11: Proceedings of the 4th ACM workshop on Security and artificial intelligence

ABSTRACT

References

Cited By

Index Terms

Recommendations

Can machine learning be secure?

Open problems in the security of learning

Understanding the risk factors of learning in adversarial environments

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media