(Enter summary)
Abstract: Software robustness has significant impact on system availability. Unfortunately, finding software bugs is a very challenging task because many bugs are hard to reproduce. While debugging a program, it would be very useful to rollback a crashed program to a previous execution point and deterministically re-execute the "buggy" code region. However, most previous work on rollback and replay support was designed to survive hardware or operating system failures, and is therefore too heavyweight for ... (Update)
Cited by: More
BugNet: Continuously Recording Program Execution for.. - Narayanasamy, Pokam.. (2005)
(Correct)
Active bibliography (related documents): More All
1.0: AccMon: Automatically Detecting Memory-related.. - Zhou, Liu, Fei.. (2004)
(Correct)
0.6: Non-Intrusive on-the-Fly Data Race Detection Using.. - Ronsse, De Bosschere (2000)
(Correct)
0.5: iWatcher: Efficient Architectural Support for Software.. - Zhou, Qin, Liu, Zhou.. (2004)
(Correct)
Similar documents based on text: More All
0.3: Argus - A distributed network-intrusion detection system - Singh, Kandula (2001)
(Correct)
0.2: Peer-To-Peer Backup for Personal Area Networks - Loo, LaMarca, Borriello (2003)
(Correct)
0.2: Unknown - Copyright By Sankalp
(Correct)
BibTeX entry: (Update)
S. M. Srinivasan, S. Kandula, C. R. Andrews, and Y. Zhou. Flashback: A lightweight extension for rollback and deterministic replay for software debugging. In USENIX Annual Technical Conference, General Track, pages 29--44, 2004. http://citeseer.comp.nus.edu.sg/654773.html More
@misc{ srinivasan04flashback,
author = "S. Srinivasan and S. Kandula and C. Andrews and Y. Zhou",
title = "Flashback: A lightweight extension for rollback and deterministic replay
for software debugging",
text = "S. M. Srinivasan, S. Kandula, C. R. Andrews, and Y. Zhou. Flashback: A
lightweight extension for rollback and deterministic replay for software
debugging. In USENIX Annual Technical Conference, General Track, pages 29--44,
2004.",
year = "2004",
url = "citeseer.comp.nus.edu.sg/654773.html" }
Citations (may not include all citations):
217
Optimistic recovery in distributed systems (context) - Strom, Yemini - 1985
180
A survey of rollback-recovery protocols in message-passing s..
- Elnozahy, Alvisi et al. - 2002
175
Dealing with disaster: Surviving misbehaved kernel extension..
- Seltzer, Endo et al. - 1996
174
Distributed programming in argus (context) - Liskov - 1988
156
Recovery in distributed systems using optimistic message log..
- Johnson, Zwaenepoel - 1988
140
Eraser: A dynamic data race detector for multithreaded progr..
- Savage, Burrows et al. - 1997
140
Extended static checking for java
- Flanagan, Leino et al. - 2002
98
Purify: Fast detection of memory leaks and access errors (context) - Hastings, Joyce - 1992
98
A message system supporting fault tolerance (context) - Borg, Baumbach et al. - 1983
80
Lightweight recoverable virtual memory
- Satyanarayanan, Mashburn et al. - 1993
77
Type-based race detection for Java
- Flanagan, Freund - 2000
69
Proactive recovery in a byzantinefault -tolerant system
- Castro, Liskov - 2000
68
ACM Transactions on Computer Systems (context) - Borg, Blau et al. - 1989
65
The ParaScope Parallel Programming Environment
- Cooper, Hall et al. - 1993
58
Tracking down software bugs using automatic anomaly detectio..
- Hangal, Lam - 2002
56
Checkpointing and its applications
- Wang, Huang et al. - 1995
54
Intrusion detection via static analysis
- Wagner, Dean - 2001
43
Recovery management in quicksilver (context) - Haskin, Malachi et al. - 1988
43
Ownership types for safe programming: Preventing data races ..
- Boyapati, Lee et al. - 2002
42
Warlock: A static data race analysis tool (context) - Sterling - 1993
41
Igor: A system for program debugging via reversible executio.. (context) - Feldman, Brown - 1989
41
BASE: Using abstraction to improve fault tolerance
- Rodrigues, Castro et al. - 2001
36
Software rejuvenation: analysis (context) - Huang, Kintala et al. - 1995
34
Free transactions with Rio Vista (context) - Lowell, Chen - 1997
32
Deterministic replay of java multithreaded applications
- Choi, Srinivasan - 1998
31
Object race detection
- Praun, Gross - 2001
29
Optimal tracing and replay for debugging shared-memory paral..
- Netzer - 1993
29
Revirt: enabling intrusion analysis through virtualmachine l.. (context) - Dunlap, Kind et al. - 2002
28
Recovery-oriented computing (context) - Patterson - 2002
27
Detecting Data Races on Weak Memory Systems
- Adve, Hill et al. - 1991
27
Race Frontier: Reproducing Data Races in Parallel-Program De.. (context) - Choi, Min - 1991
26
Completely asynchronous optimistic recovery with minimal rol..
- Smith, Johnson et al. - 1995
23
Debugging via run-time type checking
- Loginov, Yong et al. - 2001
22
Improving security using extensible lightweight static analy..
- Evans, Larochelle - 2002
21
main memory storage system (context) - Wu, Zwaenepoel et al. - 1994
20
Clip: a checkpointing tool for message-passing parallel prog..
- Chen, Plank et al. - 1997
18
Replay for concurrent nondeterministic shared-memory applica.. (context) - Russinovich, Cogswell - 1996
18
Exploring failure transparency and limits of generic recover.. (context) - Lowell, Chandra et al. - 2000
17
Supporting nondeterministic execution in fault-tolerant syst.. (context) - Slye, Elnozahy - 1996
17
Checkpointing memoryresident databases (context) - Salem, Garcia-Molina - 1987
15
RecPlay Fully Integrated Practical RecordReplay System (context) - Bosschere, Integrated et al. - 1999
14
Detecting Violations of Sequential Consistency
- Gharachorloo, gibbons - 1991
12
Why optimistic message logging has not been used in telecomm..
- Huang, Wang - 1995
11
Debugging concurrent processes: A case study (context) - Stone - 1988
10
Lclint: A tool for using speci cations to check code (context) - Evans, Guttag et al. - 1994
10
Fast cluster failover using virtual memory-mapped communicat..
- Zhou, Chen et al. - 1999
9
CCured: typesafe retro tting of legacy code
- Necula, McPeak et al. - 2002
9
Crashonly software
- Candea, Fox - 2003
9
low-overhead recovery for general applications (context) - Chen, Lowell et al. - 1998
9
A system and language for building system-speci c (context) - Hallem, Chelf et al. - 2002
8
Progressive retry for software failure recovery in messagepa.. (context) - Wang, Huang et al. - 1997
8
Integrating checkpointing with transaction processing
- Wang, Chung et al. - 1997
7
Enhancing software reliability with speculative threads
- Oplinger, Lam - 2002
7
Reenact: using thread-level speculation mechanisms to debug .. (context) - Prvulovic, Torrellas - 2003
6
Model checking java programs using java path nder (context) - Havelund, Pressburger - 1998
6
Software errors cost u (context) - of, Technlogy et al. - 2002
6
Concurrent real-time checkpoint for parallel programs (context) - Li, Naughton et al. - 1990
5
IEEE Transactions on Parallel and Distributed Systems (context) - Plank, Li et al. - 1998
5
Bugs as inconsistent behavior: A general approach to inferri.. (context) - Engler, Chen et al. - 2001
5
A rst step towards automated detection of buer overrun vuln.. (context) - Wagner, Foster et al. - 2000
5
Synchronization without contention
- Mellor-Crummey, Scott - 1991
4
Fine grained software rejuvenation models (context) - Bobbio, Sereno - 1998
4
Webstone: The rst generation in http server benchmarking (context) - Trent, Sake - 1995
3
Data replication strategies for fault tolerance and availabi..
- Amza, Cox et al. - 2000
3
A transactional memory service in an extensible operating sy..
- Saito, Bershad - 1998
3
Hardware and software support for ecient exception handling (context) - Thekkath, Levy - 1994
3
The Rio le cache: Surviving operating systems crashes (context) - Chen, Ng et al. - 1996
2
Blueprints for high availablity (context) - Marcus, Stern - 2000
2
detection of access anomalies (context) - Schonberg - 1989
2
An ecient checkpointing method for multicomputers with wormh.. (context) - Li, Naughton et al. - 1991
2
Reducing recovery time in a small recursively restartable sy..
- Candea - 2002
2
in implementing causal message logging protocols (context) - Marzullo - 1996
1
Low-latency concurrent checkpoint for parallel programs (context) - Li, Naughton et al. - 1994
1
How to recover eciently and asynchronously when optimism fai.. (context) - Damani, Garg - 1996
1
Elimination of Java array bounds checks in the presence of i.. (context) - Luj, Gurd et al. - 2002
1
A Protocol-Centric Approach to on-the-Fly Race Detection
- Perkovic, Keleher - 2000
1
An Ecient Cache-based Access Anomaly Detection Scheme (context) - Min, Choi - 1991
1
Using redundancies to nd errors (context) - Xie, Engler - 2002
1
URL: http: //developer (context) - Corporation
1
Using model checking to debug network interface rmware
- Kumar, Li - 2002
1
ReEnact: Using ThreadLevel Speculation to Debug Software; An.. (context) - Prvulovic, Torrellas - 2003
http://syscalltrack.sourceforge.net/how.html
Documents on the same site (http://www-faculty.cs.uiuc.edu/~yyzhou/publication.htm): More
Memory Management for Networked Servers - Zhou (2000)
(Correct)
Second-Level Buffer Cache Management - Yuanyuan Zhou Member
(Correct)
Experiences with VI Communication for Database Storage - Yuanyuan Zhou Angelos (2002)
(Correct)
Online articles have much greater impact More about CiteSeer.IST at NUS Add search form to your site Submit documents Feedback
CiteSeer.IST at NUS - Copyright Penn State and NEC. Hosted by the School of Computing, National University of Singapore.