Fault-Tolerant Parallel Computation

Fault-Tolerant Parallel Computation presents recent advances in algorithmic ways of introducing fault-tolerance in multiprocessors under the constraint of preserving efficiency. The difficulty associated with combining fault-tolerance and efficiency is that the two have conflicting means: fault-tole...

Full description

Bibliographic Details
Main Authors: Kanellakis, Paris Christos, Shvartsman, Alex Allister (Author)
Format: eBook
Language:English
Published: New York, NY Springer US 1997, 1997
Edition:1st ed. 1997
Series:The Springer International Series in Engineering and Computer Science
Subjects:
Online Access:
Collection: Springer Book Archives -2004 - Collection details see MPG.ReNa
LEADER 03147nmm a2200325 u 4500
001 EB000632067
003 EBX01000000000000001348804
005 00000000000000.0
007 cr|||||||||||||||||||||
008 140122 ||| eng
020 |a 9781475752106 
100 1 |a Kanellakis, Paris Christos 
245 0 0 |a Fault-Tolerant Parallel Computation  |h Elektronische Ressource  |c by Paris Christos Kanellakis, Alex Allister Shvartsman 
250 |a 1st ed. 1997 
260 |a New York, NY  |b Springer US  |c 1997, 1997 
300 |a XXIX, 183 p  |b online resource 
505 0 |a 1 Introduction -- 2 Models for Robust Computation -- 3 The Write-All Problem: Algorithms -- 4 Lower Bounds, Snapshots and Approximation -- 5 Fault-Tolerant Simulations -- 6 Shared Memory Randomized Algorithms and Distributed Models and Algorithms -- Bibliography and References -- Author Index 
653 |a Computer science 
653 |a Processor Architectures 
653 |a Microprocessors 
653 |a Theory of Computation 
653 |a Computer architecture 
700 1 |a Shvartsman, Alex Allister  |e [author] 
041 0 7 |a eng  |2 ISO 639-2 
989 |b SBA  |a Springer Book Archives -2004 
490 0 |a The Springer International Series in Engineering and Computer Science 
028 5 0 |a 10.1007/978-1-4757-5210-6 
856 4 0 |u https://doi.org/10.1007/978-1-4757-5210-6?nosfx=y  |x Verlag  |3 Volltext 
082 0 |a 004.0151 
520 |a Fault-Tolerant Parallel Computation presents recent advances in algorithmic ways of introducing fault-tolerance in multiprocessors under the constraint of preserving efficiency. The difficulty associated with combining fault-tolerance and efficiency is that the two have conflicting means: fault-tolerance is achieved by introducing redundancy, while efficiency is achieved by removing redundancy. This monograph demonstrates how in certain models of parallel computation it is possible to combine efficiency and fault-tolerance and shows how it is possible to develop efficient algorithms without concern for fault-tolerance, and then correctly and efficiently execute these algorithms on parallel machines whose processors are subject to arbitrary dynamic fail-stop errors. The efficient algorithmic approaches to multiprocessor fault-tolerance presented in this monograph make a contribution towards bridging the gap between the abstract models of parallel computation and realizable parallel architectures. Fault-Tolerant Parallel Computation presents the state of the art in algorithmic approaches to fault-tolerance in efficient parallel algorithms. The monograph synthesizes work that was presented in recent symposia and published in refereed journals by the authors and other leading researchers. This is the first text that takes the reader on the grand tour of this new field summarizing major results and identifying hard open problems. This monograph will be of interest to academic and industrial researchers and graduate students working in the areas of fault-tolerance, algorithms and parallel computation and may also be used as a text in a graduate course on parallel algorithmic techniques and fault-tolerance