Predicting Method Crashes with Bytecode Operations

Predicting Method Crashes
with Bytecode Operations
Sunghun Kim
Hong Kong University of Science and Technology, China

Thomas Zimmermann
Microsoft Research, USA

Rahul Premraj
VU University Amsterdam, The Netherlands

Nicolas Bettenburg
Queen’s University, Canada

Shivkumar Shivaji
University of California, Santa Cruz, USA

© Microsoft Corporation

capture and replay
+ prediction


Capture and Replay


ReCrash Technique
Goal: Convert a crash into a set of unit tests

1. Monitoring: maintain a shadow stack
– Contains a copy of each method argument
– On program crash, write the shadow stack to a file

2. Test generation: create many unit tests
For each stack frame, create one unit test:
– Invoke the method using arguments from the shadow stack
– If the test does not reproduce the crash, discard the test

Slide from: http://www.slideshare.net/hunkim/recrash-making-
crashes-reproducible-by-preserving-object-states

Cost of Monitoring
Key cost of ReCrash:
copying arguments to shadow stack

Tradeoff: less information in shadow stack
⇒ lower chance of reproducing crashes

Monitor fewer methods: Ignore methods not
likely to crash


ReCrash+ Technique
Goal: Convert a crash into a set of unit tests

1. Monitoring: maintain a shadow stack
– Contains a copy of each method argument
for methods predicted to crash
– On program crash, write the shadow stack to a file

2. Test generation: create many unit tests
For each stack frame, create one unit test:
– Invoke the method using arguments from the shadow stack
– If the test does not reproduce the crash, discard the test

Slide adapted from: http://www.slideshare.net/hunkim/recrash-
making-crashes-reproducible-by-preserving-object-states

crash
defect prediction


From Defect to Crash
1. The programmer creates a
defect – an error in the code.

2. When executed the defect
creates an infection – an
error in the state.

3. The infection propagates.

4. The infection causes a crash.

Slide adapted from companion materials to Why Programs Fail, 2nd Edition.
A Guide to Systematic Debugging, by Andreas Zeller, Morgan Kauffman.


Approach

Identify crashed methods Gene


Approach

Generate features from Bytecode


Approach

features from Bytecode Build model


Step 1: Identify Crashed Methods

infoZilla
Bug report
infoZilla image by Nicolas
© Microsoft Corporation Bettenburg

Step 2: Generate Features

Bytecode

Control flow graph
(basic blocks)

Step 2: Generate Features


Step 3: Build Classifier


Experiments
1. Evaluating crash prediction
– Within-project classification
– Cross-project classification
– Significant features (see paper)
– Impact of “throws” statements (see paper)

2. Reproducing crashes with ReCrash+


Evaluating Crash Prediction
• Within-project classification:
ten-fold cross validation
• Cross-project validation:
train on one project and test on the other
• Baseline: complexity metrics
Size of Method (in Bytes), Number of Conditional Statements,
Number of Scalar Locals, Number of Vector Locals, Length of Local
Identifiers, McCabe Complexity, Data Structure Complexity, Nesting
Level Complexity, Halstead complexity measures


Within-Project Classification


Cross-Project Classification


Reproducing Crashes
• Train classifier using the ECLIPSE corpus
• Classify methods from a different project
called SVNKit.
– 2,347 methods of which 27% were classified
as crash-prone
• Apply ReCrash+: monitor only those
methods predicted to be crash-prone
– Three crashes from original ReCrash paper


Reproducing Crashes
All 3 crashes from SVNKit were successfully
reproduced by ReCrash+.

Runtime overhead decreased:


Reproducing Crashes
Only a subset
of methods had
to be monitored:


Conclusion
• Monitoring crash-prone methods reduced the
overhead significantly at almost no cost.
• Opportunity for capture and replay tools to
reduce overhead with prediction models.
• Value of project’s history for the identification
of crash-prone methods.
• Potential value of Bytecode features for
prediction models.


Predicting Method Crashes with Bytecode Operations

More Related Content

What's hot (19)

Viewers also liked (7)

Similar to Predicting Method Crashes with Bytecode Operations (20)

More from Thomas Zimmermann (20)

Recently uploaded (20)

Predicting Method Crashes with Bytecode Operations