Crash-Only Software (2003)  (Make Corrections)  (9 citations)
George Candea, Armando Fox
Proceedings of the 9th Workshop on Hot Topics in Operating Systems (HotOS IX)

 @ NUS   Home/Search   Context   Related

 
View or download:
usenix.org/events/hotos03/...candea.pdf
Cached:  PS.gz  PS  PDF  Image  Update  Help

From:  usenix.org/events/hotos0...candea (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Crash-only programs crash safely and recover quickly. There is only one way to stop such software---by crashing it---and only one way to bring it up---by initiating recovery. Crash-only systems are built from crash-only components, and the use of transparent component-level retries hides intra-system component crashes from end users. In this paper we advocate a crash-only design for Internet systems, showing that it can lead to more reliable, predictable code and faster, more effective... (Update)

Cited by:   More
Combining Statistical Monitoring and Predictable - Recovery For Self-Management   (Correct)
Building a Reactive Immune System for Software Services - Sidiroglou, Locasto.. (2004)   (Correct)
Microreboot - A Technique for Cheap Recovery - Candea, Kawamoto, Fujiki.. (2004)   (Correct)

Similar documents (at the sentence level):
19.3%:   Improving Availability with Recursive Micro-Reboots: A.. - Candea, Cutler, Fox (2003)   (Correct)

Active bibliography (related documents):   More   All
0.8:   Session State: Beyond Soft State - Benjamin Ling Emre (2004)   (Correct)
0.4:   A Survey of Fault-Tolerance and Fault-Recovery Techniques in.. - Treaster (2005)   (Correct)
0.3:   JAGR: An Autonomous Self-Recovering Application Server - Candea, Kiciman, Zhang.. (2003)   (Correct)

Similar documents based on text:   More   All
0.5:   USENIX Association - Hotos Ix The   (Correct)
0.5:   Cassyopia: Compiler Assisted System Optimization - Rajagopalan, Debray.. (2003)   (Correct)
0.5:   Exploiting the Synergy between Peer-to-Peer and Mobile Ad Hoc .. - Hu, Das, Pucha (2003)   (Correct)

Related documents from co-citation:   More   All
5:   Lessons from giant-scale services - Brewer
4:   Finding surprising patterns in a time series database in linear time and space - Keogh, Lonardi et al. - 2002
4:   Session state: Beyond soft state - Ling, Kiciman et al. - 2004

BibTeX entry:   (Update)

G. Candea and A. Fox. Crash-only software. In Proc. 9th Workshop on Hot Topics in Operating Systems, Lihue, Hawaii, 2003. http://citeseer.comp.nus.edu.sg/665494.html   More

@inproceedings{ candea03crash,
  author = "George Candea and Armando Fox",
  title = "Crash-Only Software",
  booktitle = "Proceedings of the 9th Workshop on Hot Topics in Operating Systems (HotOS IX)",
  year = "2003",
  month = "May"	 ,
  address = "Lihue, HI",
  url = "citeseer.comp.nus.edu.sg/665494.html" }
Citations (may not include all citations):
901   Transaction processing: concepts and techniques (context) - Gray, Reuter - 1993
235   Practical Byzantine fault tolerance - Castro, Liskov - 1999
123   Leases: An efficient faulttolerant mechanism for distributed.. (context) - Gray, Cheriton - 1989
84   distributed data structures for Internet service constructio.. (context) - Gribble, Brewer et al. - 2000
83   The design of the Postgres storage system - Stonebraker - 1987
79   Why do computers stop and what can be done about it - Gray - 1986
62   Scale and performance in the Denali isolation kernel - Whitaker, Shaw et al. - 2002
45   Recursive restartability: Turning the reboot sledgehammer in.. - Candea, Fox - 2001
38   An empirical study of operating systems errors - Chou, Yang et al. - 2001
36   Software rejuvenation: Analysis (context) - Huang, Kintala et al. - 1995
30   Berkeley DB (context) - Olson, Bostic et al. - 1999
27   Measuring system and software reliability using an automated.. (context) - Murphy, Gent - 1995
13   Multitasking without compromise: A virtual machine evolution (context) - Czajkowski, Daynes - 2001
13   A methodology for detection and estimation of software aging (context) - Garg, Moorsel et al. - 1998
9   Fail-stutter fault tolerance (context) - Arpaci-Dusseau, Arpaci-Dusseau - 2001
7   Fast-Start: Quick fault recovery in Oracle (context) - Lahiri, Ganesh et al. - 2001
6   JAGR: An autonomous self-recovering application server - Candea, Keyani et al. - 2003
6   Perfect failure detection in timed asynchronous systems - Fetzer - 2003
6   Using fault model enforcement to improve availability - Nagaraja, Bianchini et al. - 2002
4   System reliability and availability drivers of Tru64 UNIX (context) - Murphy, Davies - 1999
4   Personal communication (context) - Pal - 2002
3   Improving availability with recursive micro-reboots: A soft-.. - Candea, Cutler et al. - 2003
2   selfhealing session state management layer (context) - Ling, Fox et al. - 2003
2   Sustainable infrastructures: How IT services can address the.. (context) - Adams, Igou et al. - 2001
1   Application isolation API specification (context) - Soper, Donald et al. - 2002
1   Decoupled storage: State with stateless-like properties (context) - Huang, Fox - 2003



The graph only includes citing articles where the year of publication is known.


Online articles have much greater impact   More about CiteSeer.IST at NUS   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST at NUS - Copyright Penn State and NEC. Hosted by the School of Computing, National University of Singapore.