Reliability modeling of large fault-tolerant systems

Suri, N. and Hugue, M.M. and Walter, C.J. (1992) Reliability modeling of large fault-tolerant systems. In: Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing. IEEE, pp. 212-220. ISBN 0818628758

Full text not available from this repository.

Abstract

A cluster-based ultrareliable architecture is presented, offering synchronization and system functionality comparable to that of fully connected systems, with reduced system overhead. A reliability model considering the distribution of concurrent faults across the system clusters is shown to increase the accuracy of reliability and system fault-tolerance estimates. The hybrid fault model, which classifies faults based on their behavior, further improves reliability estimates and enhances the fault handling capability of each cluster. Linear growth in cluster reliability with respect to cluster size is possible, as are refinements in the convergence and consistency algorithms for synchronization. © 1992 IEEE.

Item Type:
Contribution in Book/Report/Proceedings
Subjects:
?? CLUSTER COMPUTINGFAULT TOLERANT COMPUTER SYSTEMSSYSTEMS ANALYSISCONNECTED SYSTEMSCONSISTENCY ALGORITHMSFAULT TOLERANT SYSTEMSREDUCED SYSTEMSRELIABILITY ESTIMATESRELIABILITY MODELSYSTEM FAULT TOLERANCESYSTEM FUNCTIONALITYFAULT TOLERANCE ??
ID Code:
137422
Deposited By:
Deposited On:
07 Oct 2019 13:00
Refereed?:
Yes
Published?:
Published
Last Modified:
16 Sep 2023 03:18