(Enter summary)
Abstract: Crash-only programs crash safely and recover quickly. There is only one way to stop such software---by crashing it---and only one way to bring it up---by initiating recovery. Crash-only systems are built from crash-only components, and the use of transparent component-level retries hides intra-system component crashes from end users. In this paper we advocate a crash-only design for Internet systems, showing that it can lead to more reliable, predictable code and faster, more effective... (Update)
Cited by: More
Combining Statistical Monitoring and Predictable - Recovery For Self-Management
(Correct)
Building a Reactive Immune System for Software Services - Sidiroglou, Locasto.. (2004)
(Correct)
Microreboot - A Technique for Cheap Recovery - Candea, Kawamoto, Fujiki.. (2004)
(Correct)
Similar documents (at the sentence level):
19.3%: Improving Availability with Recursive Micro-Reboots: A.. - Candea, Cutler, Fox (2003)
(Correct)
Active bibliography (related documents): More All
0.8: Session State: Beyond Soft State - Benjamin Ling Emre (2004)
(Correct)
0.4: A Survey of Fault-Tolerance and Fault-Recovery Techniques in.. - Treaster (2005)
(Correct)
0.3: JAGR: An Autonomous Self-Recovering Application Server - Candea, Kiciman, Zhang.. (2003)
(Correct)
Similar documents based on text: More All
0.5: USENIX Association - Hotos Ix The
(Correct)
0.5: Cassyopia: Compiler Assisted System Optimization - Rajagopalan, Debray.. (2003)
(Correct)
0.5: Exploiting the Synergy between Peer-to-Peer and Mobile Ad Hoc .. - Hu, Das, Pucha (2003)
(Correct)
Related documents from co-citation: More All
5: Lessons from giant-scale services
- Brewer
4: Finding surprising patterns in a time series database in linear time and space
- Keogh, Lonardi et al. - 2002
4: Session state: Beyond soft state
- Ling, Kiciman et al. - 2004
BibTeX entry: (Update)
G. Candea and A. Fox. Crash-only software. In Proc. 9th Workshop on Hot Topics in Operating Systems, Lihue, Hawaii, 2003. http://citeseer.comp.nus.edu.sg/665494.html More
@inproceedings{ candea03crash,
author = "George Candea and Armando Fox",
title = "Crash-Only Software",
booktitle = "Proceedings of the 9th Workshop on Hot Topics in Operating Systems (HotOS IX)",
year = "2003",
month = "May" ,
address = "Lihue, HI",
url = "citeseer.comp.nus.edu.sg/665494.html" }
Citations (may not include all citations):
901
Transaction processing: concepts and techniques (context) - Gray, Reuter - 1993
235
Practical Byzantine fault tolerance
- Castro, Liskov - 1999
123
Leases: An efficient faulttolerant mechanism for distributed.. (context) - Gray, Cheriton - 1989
84
distributed data structures for Internet service constructio.. (context) - Gribble, Brewer et al. - 2000
83
The design of the Postgres storage system
- Stonebraker - 1987
79
Why do computers stop and what can be done about it
- Gray - 1986
62
Scale and performance in the Denali isolation kernel
- Whitaker, Shaw et al. - 2002
45
Recursive restartability: Turning the reboot sledgehammer in..
- Candea, Fox - 2001
38
An empirical study of operating systems errors
- Chou, Yang et al. - 2001
36
Software rejuvenation: Analysis (context) - Huang, Kintala et al. - 1995
30
Berkeley DB (context) - Olson, Bostic et al. - 1999
27
Measuring system and software reliability using an automated.. (context) - Murphy, Gent - 1995
13
Multitasking without compromise: A virtual machine evolution (context) - Czajkowski, Daynes - 2001
13
A methodology for detection and estimation of software aging (context) - Garg, Moorsel et al. - 1998
9
Fail-stutter fault tolerance (context) - Arpaci-Dusseau, Arpaci-Dusseau - 2001
7
Fast-Start: Quick fault recovery in Oracle (context) - Lahiri, Ganesh et al. - 2001
6
JAGR: An autonomous self-recovering application server
- Candea, Keyani et al. - 2003
6
Perfect failure detection in timed asynchronous systems
- Fetzer - 2003
6
Using fault model enforcement to improve availability
- Nagaraja, Bianchini et al. - 2002
4
System reliability and availability drivers of Tru64 UNIX (context) - Murphy, Davies - 1999
4
Personal communication (context) - Pal - 2002
3
Improving availability with recursive micro-reboots: A soft-..
- Candea, Cutler et al. - 2003
2
selfhealing session state management layer (context) - Ling, Fox et al. - 2003
2
Sustainable infrastructures: How IT services can address the.. (context) - Adams, Igou et al. - 2001
1
Application isolation API specification (context) - Soper, Donald et al. - 2002
1
Decoupled storage: State with stateless-like properties (context) - Huang, Fox - 2003
The graph only includes citing articles where the year of publication is known.
Online articles have much greater impact More about CiteSeer.IST at NUS Add search form to your site Submit documents Feedback
CiteSeer.IST at NUS - Copyright Penn State and NEC. Hosted by the School of Computing, National University of Singapore.