research-article

Efficient state merging in symbolic execution

Authors:
Volodymyr Kuznetsov

École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland

École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
View Profile

,
Johannes Kinder

École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland

École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
View Profile

,
Stefan Bucur

École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland

École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
View Profile

,
George Candea

École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland

École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
View Profile

PLDI '12: Proceedings of the 33rd ACM SIGPLAN Conference on Programming Language Design and ImplementationJune 2012Pages 193–204https://doi.org/10.1145/2254064.2254088

Published:11 June 2012Publication History

PLDI '12: Proceedings of the 33rd ACM SIGPLAN Conference on Programming Language Design and Implementation

Pages 193–204

ABSTRACT

Symbolic execution has proven to be a practical technique for building automated test case generation and bug finding tools. Nevertheless, due to state explosion, these tools still struggle to achieve scalability. Given a program, one way to reduce the number of states that the tools need to explore is to merge states obtained on different paths. Alas, doing so increases the size of symbolic path conditions (thereby stressing the underlying constraint solver) and interferes with optimizations of the exploration process (also referred to as search strategies). The net effect is that state merging may actually lower performance rather than increase it.

We present a way to automatically choose when and how to merge states such that the performance of symbolic execution is significantly increased. First, we present query count estimation, a method for statically estimating the impact that each symbolic variable has on solver queries that follow a potential merge point; states are then merged only when doing so promises to be advantageous. Second, we present dynamic state merging, a technique for merging states that interacts favorably with search strategies in automated test case generation and bug finding tools.

Experiments on the 96 GNU Coreutils show that our approach consistently achieves several orders of magnitude speedup over previously published results. Our code and experimental data are publicly available at http://cloud9.epfl.ch.

References

S. Anand, P. Godefroid, and N. Tillmann. Demand-driven compositional symbolic execution. In Intl. Conf. on Tools and Algorithms for the Construction and Analysis of Systems (TACAS), 2008. Google ScholarDigital Library
D. Babic and A. J. Hu. Calysto: scalable and precise extended static checking. In Intl. Conf. on Software Engineering (ICSE), 2008. Google ScholarDigital Library
P. Boonstoppel, C. Cadar, and D. R. Engler. RWset: Attacking path explosion in constraint-based test generation. In Intl. Conf. on Tools and Algorithms for the Construction and Analysis of Systems (TACAS), 2008. Google ScholarDigital Library
R. S. Boyer, B. Elspas, and K. N. Levitt. SELECT -- a formal system for testing and debugging programs by symbolic execution. In Intl. Conf. on Reliable Software (ICRS), 1975. Google ScholarDigital Library
C. Cadar, V. Ganesh, P. M. Pawlowski, D. L. Dill, and D. R. Engler. EXE: Automatically generating inputs of death. In Conf. on Computer and Communications Security (CCS), 2006. Google ScholarDigital Library
C. Cadar, D. Dunbar, and D. R. Engler. KLEE: Unassisted and automatic generation of high-coverage tests for complex systems programs. In Symp. on Operating Systems Design and Implementation (SOSP), 2008. Google ScholarDigital Library
V. Chipounov, V. Kuznetsov, and G. Candea. S2E: A platform for in-vivo multi-path analysis of software systems. In Intl. Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2011. Google ScholarDigital Library
E. Clarke, D. Kroening, and F. Lerda. A tool for checking ANSI-C programs. In Intl. Conf. on Tools and Algorithms for the Construction and Analysis of Systems (TACAS), 2004.Google ScholarCross Ref
M. Das, S. Lerner, and M. Seigle. ESP: Path-sensitive program verification in polynomial time. In Intl. Conf. on Programming Language Design and Implem. (PLDI), 2002. Google ScholarDigital Library
L. M. de Moura and N. Bjørner. Z3: An efficient SMT solver. In Intl. Conf. on Tools and Algorithms for the Construction and Analysis of Systems (TACAS), 2008. Google ScholarDigital Library
E. W. Dijkstra. Guarded commands, nondeterminacy and formal derivation of programs. Commun. ACM, 18(8):453--457, 1975. Google ScholarDigital Library
N. Eén and N. Sörensson. An extensible SAT-solver. In Intl. Conf. on Theory and Applications of Satisfiability Testing (SAT), 2003.Google Scholar
C. Flanagan, K. R. M. Leino, M. Lillibridge, G. Nelson, J. B. Saxe, and R. Stata. Extended static checking for Java. In Intl. Conf. on Programming Language Design and Implem. (PLDI), 2002. Google ScholarDigital Library
M. K. Ganai and A. Gupta. Tunneling and slicing: towards scalable BMC. In Design Automation Conf. (DAC), 2008. Google ScholarDigital Library
V. Ganesh and D. L. Dill. A decision procedure for bit-vectors and arrays. In Intl. Conf. on Computer Aided Verification (CAV), 2007. Google ScholarDigital Library
P. Godefroid. Compositional dynamic test generation. In Symp. on Principles of Programming Languages (POPL), 2007. Google ScholarDigital Library
P. Godefroid and D. Luchaup. Automatic partial loop summarization in dynamic test generation. In Intl. Symp. on Software Testing and Analysis (ISSTA), 2011. Google ScholarDigital Library
P. Godefroid, N. Klarlund, and K. Sen. DART: Directed automated random testing. In Intl. Conf. on Programming Language Design and Implem. (PLDI), 2005. Google ScholarDigital Library
P. Godefroid, M. Y. Levin, and D. Molnar. Automated whitebox fuzz testing. In Network and Distributed System Security Symp. (NDSS), 2008.Google Scholar
T. Hansen, P. Schachte, and H. Sondergaard. State joining and splitting for the symbolic execution of binaries. In Intl. Conf. on Runtime Verification (RV), 2009. Google ScholarDigital Library
F. Ivancic, Z. Yang, M. K. Ganai, A. Gupta, I. Shlyakhter, and P. Ashar. F-soft: Software verification platform. In Intl. Conf. on Computer Aided Verification (CAV), 2005. Google ScholarDigital Library
F. Ivancic, G. Balakrishnan, A. Gupta, S. Sankaranarayanan, N. Maeda, H. Tokuoka, T. Imoto, and Y. Miyazaki. DC2: A framework for scalable, scope-bounded software verification. In Intl. Conf. on Automated Software Engineering (ASE), 2011. Google ScholarDigital Library
J. C. King. A new approach to program testing. In Intl. Conf. on Reliable Software (ICRS), 1975. Google ScholarDigital Library
S. K. Lahiri and S. Qadeer. Back to the future: revisiting precise program verification using SMT solvers. In Symp. on Principles of Programming Languages (POPL), 2008. Google ScholarDigital Library
C. Lattner and V. Adve. LLVM: A compilation framework for lifelong program analysis and transformation. In Intl. Symp. on Code Generation and Optimization (CGO), 2004. Google ScholarDigital Library
K. R. M. Leino and P. Rümmer. A polymorphic intermediate verification language: Design and logical encoding. In Intl. Conf. on Tools and Algorithms for the Construction and Analysis of Systems (TACAS), 2010. Google ScholarDigital Library
L. Mauborgne and X. Rival. Trace partitioning in abstract interpretation based static analyzers. In European Symp. on Programming (ESOP), 2005. Google ScholarDigital Library
K. L. McMillan. Lazy annotation for program testing and verification. In Intl. Conf. on Computer Aided Verification (CAV), 2010. Google ScholarDigital Library
O. Sery, G. Fedyukovich, and N. Sharygina. Interpolation-based function summaries in bounded model checking. In Haifa Verification Conf. (HVC), 2011.Google Scholar
D. Wheeler. SLOCCount. http://www.dwheeler.com/sloccount/,2010.Google Scholar
Y. Xie and A. Aiken. Scalable error detection using boolean satisfiability. In Symp. on Principles of Programming Languages (POPL), 2005. Google ScholarDigital Library

Index Terms

Recommendations

Enhancing symbolic execution with veritesting
ICSE 2014: Proceedings of the 36th International Conference on Software Engineering

We present MergePoint, a new binary-only symbolic execution system for large-scale and fully unassisted testing of commodity off-the-shelf (COTS) software. MergePoint introduces veritesting, a new technique that employs static symbolic execution to ...
Read More
Efficient state merging in symbolic execution
PLDI '12

Symbolic execution has proven to be a practical technique for building automated test case generation and bug finding tools. Nevertheless, due to state explosion, these tools still struggle to achieve scalability. Given a program, one way to reduce the ...
Read More
State Merging with Quantifiers in Symbolic Execution
ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering

We address the problem of constraint encoding explosion which hinders the applicability of state merging in symbolic execution. Specifically, our goal is to reduce the number of disjunctions and if-then-else expressions introduced during state ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
PLDI '12: Proceedings of the 33rd ACM SIGPLAN Conference on Programming Language Design and Implementation
June 2012
572 pages
ISBN:9781450312059
DOI:10.1145/2254064
General Chairs:
Jan Vitek
Purdue University
,
Haibo Lin
Microsoft China
,
Program Chair:
Frank Tip
IBM T.J. Watson Research Center
ACM SIGPLAN Notices Volume 47, Issue 6
PLDI '12
June 2012
534 pages
ISSN:0362-1340
EISSN:1558-1160
DOI:10.1145/2345156
Issue’s Table of Contents
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 June 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
bounded software model checking
state merging
symbolic execution
testing
verification
Qualifiers
- research-article
Conference

Acceptance Rates
PLDI '12 Paper Acceptance Rate48of255submissions,19%Overall Acceptance Rate406of2,067submissions,20%
More
Upcoming Conference
PLDI '24

Sponsor:

sigplan

ACM SIGPLAN Conference on Programming Language Design and Implementation

June 24 - 28, 2024

Copenhagen , Denmark
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 178
  Total Citations
  View Citations
- 1,448
  Total Downloads
- Downloads (Last 12 months)100
- Downloads (Last 6 weeks)9
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Efficient state merging in symbolic execution

PLDI '12: Proceedings of the 33rd ACM SIGPLAN Conference on Programming Language Design and Implementation

ABSTRACT

References

Cited By

Index Terms

Recommendations

Enhancing symbolic execution with veritesting

Efficient state merging in symbolic execution

State Merging with Quantifiers in Symbolic Execution