research-article

Open Access

Concerto: a framework for combined concrete and abstract interpretation

Authors:
John Toman

University of Washington, USA

University of Washington, USA
View Profile

,
Dan Grossman

University of Washington, USA

University of Washington, USA
View Profile

Proceedings of the ACM on Programming Languages Volume 3 Issue POPLArticle No.: 43pp 1–29https://doi.org/10.1145/3290356

Published:02 January 2019Publication History

Related Artifact: Artifact for Concerto: A Framework for Combined Concrete and Abstract Interpretation November 2018 software https://doi.org/10.1145/3291640

Proceedings of the ACM on Programming Languages

Abstract

Abstract interpretation promises sound but computable static summarization of program behavior. However, modern software engineering practices pose significant challenges to this vision, specifically the extensive use of frameworks and complex libraries. Frameworks heavily use reflection, metaprogramming, and multiple layers of abstraction, all of which confound even state-of-the-art abstract interpreters. Sound but conservative analysis of frameworks is impractically imprecise, and unsoundly ignoring reflection and metaprogramming is untenable given the prevalence of these features. Manually modeling framework behaviors offers excellent precision, at the cost of immense effort by the tool designer.

To overcome the above difficulties, we present Concerto, a system for analyzing framework-based applications by soundly combining concrete and abstract interpretation. Concerto analyzes framework implementations using concrete interpretation, and application code using abstract interpretation. This technique is possible in practice as framework implementations typically follow a single path of execution when provided a concrete, application-specific configuration file which is often available at analysis time. Concerto exploits this configuration information to precisely resolve reflection and other metaprogramming idioms during concrete execution. In contrast, application code may have infinitely many paths of execution, so Concerto switches to abstract interpretation to analyze application code. Concerto is an analysis framework, and can be instantiated with any abstract interpretation that satisfies a small set of preconditions. In addition, unlike manual modeling, Concerto is not specialized to any specific framework implementation. We have formalized our approach and proved several important properties including soundness and termination. In addition, we have implemented an initial proof of concept prototype of Concerto for a subset of Java, and found that our combined interpretation significantly improves analysis precision and performance.

Supplemental Material

a43-toman.webm

webm

83.1 MB

Download

Available for Download

zip

popl19main-p6-p-aux.zip (844 KB)

Supplemental appendices containing proofs for the theorems of the paper.

References

Steven Arzt, Siegfried Rasthofer, Christian Fritz, Eric Bodden, Alexandre Bartel, Jacques Klein, Yves Le Traon, Damien Octeau, and Patrick McDaniel. 2014. FlowDroid: Precise Context, Flow, Field, Object-sensitive and Lifecycle-aware Taint Analysis for Android Apps. In PLDI. Google ScholarDigital Library
Thanassis Avgerinos, Alexandre Rebert, Sang Kil Cha, and David Brumley. 2014. Enhancing symbolic execution with veritesting. In ICSE. Google ScholarDigital Library
Davide Balzarotti, Marco Cova, Vika Felmetsger, Nenad Jovanovic, Engin Kirda, Christopher Kruegel, and Giovanni Vigna. 2008. Saner: Composing static and dynamic analysis to validate sanitization in web applications. In Symposium on Security and Privacy . Google ScholarDigital Library
Paulo Barros, René Just, Suzanne Millstein, Paul Vines, Werner Dietl, Michael D Ernst, et al. 2015. Static analysis of implicit control flow: Resolving Java reflection and Android intents. In ASE.Google Scholar
Sam Blackshear, Alexandra Gendreau, and Bor-Yuh Evan Chang. 2015. Droidel: A general approach to android framework modeling. In SOAP. Google ScholarDigital Library
Eric Bodden, Andreas Sewe, Jan Sinschek, Hela Oueslati, and Mira Mezini. 2011. Taming reflection: Aiding static analysis in the presence of reflection and custom class loaders. In ICSE. Google ScholarDigital Library
François Bourdoncle. 1993. Efficient chaotic iteration strategies with widenings. In Formal Methods in Programming and their Applications .Google Scholar
Guillaume Brat, Klaus Havelund, SeungJoon Park, and Willem Visser. 2000. Java PathFinder-second generation of a Java model checker. In Workshop on Advances in Verification.Google Scholar
Jörg Brauer, Thomas Noll, and Bastian Schlich. 2010. Interval analysis of microcontroller code using abstract interpretation of hardware and software. In Workshop on Software & Compilers for Embedded Systems. Google ScholarDigital Library
Matt Brown and Jens Palsberg. 2017. Jones-optimal partial evaluation by specialization-safe normalization. Proc. ACM Program. Lang. 2, POPL (2017), 14. Google ScholarDigital Library
Cristian Cadar, Daniel Dunbar, Dawson R Engler, et al. 2008. KLEE: Unassisted and Automatic Generation of High-Coverage Tests for Complex Systems Programs. In OSDI. Google ScholarDigital Library
Vitaly Chipounov, Volodymyr Kuznetsov, and George Candea. 2011. S2E: A Platform for In-vivo Multi-path Analysis of Software Systems. In ASPLOS. Google ScholarDigital Library
Ravi Chugh, Jeffrey A. Meister, Ranjit Jhala, and Sorin Lerner. 2009. Staged Information Flow for Javascript. In PLDI. Google ScholarDigital Library
Patrick Cousot. 1977. Asynchronous iterative methods for solving a fixed point system of monotone equations in a complete lattice. Res. rep. RR 88 (1977).Google Scholar
Patrick Cousot and Radhia Cousot. 1977. Abstract interpretation: a unified lattice model for static analysis of programs by construction or approximation of fixpoints. In POPL. Google ScholarDigital Library
Patrick Cousot and Radhia Cousot. 1979a. Constructive versions of Tarski’s fixed point theorems. Pacific journal of Mathematics 82, 1 (1979).Google Scholar
Patrick Cousot and Radhia Cousot. 1979b. Systematic design of program analysis frameworks. In POPL. ACM. Google ScholarDigital Library
Patrick Cousot and Radhia Cousot. 1992a. Abstract interpretation and application to logic programs. The Journal of Logic Programming 13, 2-3 (1992), 103–179. Google ScholarDigital Library
Patrick Cousot and Radhia Cousot. 1992b. Abstract interpretation frameworks. Journal of logic and computation 2, 4 (1992).Google ScholarCross Ref
Patrick Cousot, Radhia Cousot, Jérôme Feret, Laurent Mauborgne, Antoine Miné, David Monniaux, and Xavier Rival. 2005. The ASTRÉE analyzer. In ESOP.Google Scholar
Patrick Cousot, Radhia Cousot, Jérôme Feret, Laurent Mauborgne, Antoine Miné, David Monniaux, and Xavier Rival. 2006. Combination of abstractions in the ASTRÉE static analyzer. In Annual Asian Computing Science Conference. Google ScholarDigital Library
Christoph Csallner, Yannis Smaragdakis, and Tao Xie. 2008. DSD-Crasher: A hybrid analysis tool for bug finding. TOSEM 17, 2 (2008), 8. Google ScholarDigital Library
Dorothy E Denning. 1976. A lattice model of secure information flow. Commun. ACM 19, 5 (1976). Google ScholarDigital Library
Alain Deutsch. 1994. Interprocedural May-alias Analysis for Pointers: Beyond K-limiting. In PLDI. Google ScholarDigital Library
Bruno Dufour, Barbara G. Ryder, and Gary Sevitsky. 2007. Blended analysis for performance understanding of frameworkbased applications. In ISSTA. Google ScholarDigital Library
Manuel Fähndrich and Francesco Logozzo. 2010. Static contract checking with abstract interpretation. In Formal Verification of Object-Oriented Software .Google Scholar
Pietro Ferrara. 2010. Static type analysis of pattern matching by abstract interpretation. In Formal Techniques for Distributed Systems . Springer, 186–200. Google ScholarDigital Library
Pietro Ferrara. 2014. Generic combination of heap and value analyses in abstract interpretation. In VMCAI. Google ScholarDigital Library
Stephen J. Fink, Eran Yahav, Nurit Dor, G. Ramalingam, and Emmanuel Geay. 2008. Effective typestate verification in the presence of aliasing. TOSEM 17, 2 (2008). Google ScholarDigital Library
Martin Fowler. 2004. Inversion of control containers and the dependency injection pattern. (2004).Google Scholar
Yoshihiko Futamura. 1999. Partial evaluation of computation process–an approach to a compiler-compiler. Higher-Order and Symbolic Computation 12, 4 (1999), 381–391. Google ScholarDigital Library
Patrice Godefroid, Nils Klarlund, and Koushik Sen. 2005. DART: Directed Automated Random Testing. In PLDI. Google ScholarDigital Library
Neville Grech, George Fourtounis, Adrian Francalanza, and Yannis Smaragdakis. 2017. Heaps Don’t Lie: Countering Unsoundness with Heap Snapshots. Proc. ACM Program. Lang. 1, OOPSLA, Article 68 (2017). Google ScholarDigital Library
Neil D Jones, Carsten K Gomard, and Peter Sestoft. 1993. Partial evaluation and automatic program generation. Peter Sestoft. Google ScholarDigital Library
Neil D Jones and Steven S Muchnick. 1979. Flow analysis and optimization of LISP-like structures. In POPL. Google ScholarDigital Library
John B Kam and Jeffrey D Ullman. 1977. Monotone data flow analysis frameworks. Acta Informatica 7, 3 (1977), 305–317. Google ScholarDigital Library
Yoonseok Ko, Hongki Lee, Julian Dolby, and Sukyoung Ryu. 2015. Practically Tunable Static Analysis Framework for LargeScale JavaScript Applications. In ASE.Google Scholar
Monica S. Lam, Michael Martin, Benjamin Livshits, and John Whaley. 2008. Securing web applications with static and dynamic information flow tracking. In Partial Evaluation and Semantics-based Program Manipulation. Google ScholarDigital Library
Vincent Laviron and Francesco Logozzo. 2009. Subpolyhedra: A (more) scalable approach to infer linear inequalities. In VMCAI . Google ScholarDigital Library
Yue Li, Tian Tan, and Jingling Xue. 2015. Effective soundness-guided reflection analysis. In SAS.Google Scholar
Francesco Logozzo and Manuel Fähndrich. 2008. Pentagons: a weakly relational abstract domain for the efficient validation of array accesses. In Symposium on Applied Computing. Google ScholarDigital Library
Torben Mogensen. 1995. Self-applicable online partial evaluation of the pure lambda calculus. In Symposium on Partial Evaluation and Semantics-based Program Manipulation . Google ScholarDigital Library
Flemming Nielson. 1985. Tensor products generalize the relational data flow analysis method. In 4th Hungarian Computer Science Conference . 211–225.Google Scholar
Brianna M. Ren and Jeffrey S. Foster. 2016. Just-in-time Static Type Checking for Dynamic Languages. In PLDI. Google ScholarDigital Library
Koushik Sen and Gul Agha. 2006. CUTE and jCUTE: Concolic unit testing and explicit path model-checking tools. In CAV. Google ScholarDigital Library
Koushik Sen, Darko Marinov, and Gul Agha. 2005. CUTE: A Concolic Unit Testing Engine for C. In ESEC/FSE. Google ScholarDigital Library
Yannis Smaragdakis, George Balatsouras, George Kastrinis, and Martin Bravenboer. 2015. More sound static handling of Java reflection. In ASPLAS.Google Scholar
Manu Sridharan, Shay Artzi, Marco Pistoia, Salvatore Guarnieri, Omer Tripp, and Ryan Berg. 2011. F4F: Taint Analysis of Framework-based Web Applications. In OOPSLA. Google ScholarDigital Library
Gregory T Sullivan. 2001. Dynamic partial evaluation. In Programs as Data Objects. 238–256. Google ScholarDigital Library
Vijay Sundaresan, Laurie Hendren, Chrislain Razafimahefa, Raja Vallée-Rai, Patrick Lam, Etienne Gagnon, and Charles Godin. 2000. Practical Virtual Method Call Resolution for Java. In OOPSLA. Google ScholarDigital Library
Alfred Tarski. 1955. A lattice-theoretical fixpoint theorem and its applications. Pacific journal of Mathematics 5, 2 (1955).Google Scholar
Antoine Toubhans, Bor-Yuh Evan Chang, and Xavier Rival. 2013. Reduced product combination of abstract domains for shapes. In VMCAI. Google ScholarDigital Library
Shiyi Wei and Barbara G Ryder. 2013. Practical blended taint analysis for JavaScript. In ISSTA. Google ScholarDigital Library
Matteo Zanioli, Pietro Ferrara, and Agostino Cortesi. 2012. SAILS: Static Analysis of Information Leakage with Sample. In Symposium on Applied Computing . Google ScholarDigital Library

Index Terms

Concerto: a framework for combined concrete and abstract interpretation
1. Software and its engineering
  1. Software notations and tools
    1. General programming languages
      1. Language features
        Frameworks
  2. Software organization and properties
    1. Software functional properties
      1. Formal methods
        Automated static analysis

Recommendations

Control-flow analysis of function calls and returns by abstract interpretation

Abstract interpretation techniques are used to derive a control-flow analysis for a simple higher-order functional language. The analysis approximates the interprocedural control-flow of both function calls and returns in the presence of first-class ...
Read More
Deriving escape analysis by abstract interpretation

Escape analysis of object-oriented languages approximates the set of objects which do not escape from a given context. If we take a method as context, the non-escaping objects can be allocated on its activation stack; if we take a thread, Java ...
Read More
A²I: abstract² interpretation

The fundamental idea of Abstract² Interpretation (A²I), also called meta-abstract interpretation, is to apply abstract interpretation to abstract interpretation-based static program analyses. A²I is generally meant to use abstract interpretation to ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

Proceedings of the ACM on Programming Languages Volume 3, Issue POPL
January 2019
2275 pages
EISSN:2475-1421
DOI:10.1145/3302515
Issue’s Table of Contents

Copyright © 2019 Owner/Author
This work is licensed under a Creative Commons Attribution International 4.0 License.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 January 2019
Published in pacmpl Volume 3, Issue POPL

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Badges
- Artifacts Available
- Artifacts Evaluated & Functional
Author Tags
abstract interpretation
framework-based applications
metaprogramming
reflection
state separation
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 7
  Total Citations
  View Citations
- 1,232
  Total Downloads
- Downloads (Last 12 months)140
- Downloads (Last 6 weeks)18
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Concerto: a framework for combined concrete and abstract interpretation

Proceedings of the ACM on Programming Languages

Abstract

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

Control-flow analysis of function calls and returns by abstract interpretation

Deriving escape analysis by abstract interpretation

A²I: abstract² interpretation