A similar resource auto-discovery based adaptive fault-tolerance method for embedded distributed system

Kailong Zhang, Ke Liang, Xingshe Zhou, Kaibo Wang, Xiao Wu, Zhiyi Yang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Because of the resource constraints and high reliability requirement of Embedded Distributed System (EDS), some new fault-tolerance means, which are different from the traditional hardware-redundancy ones, should be studied. In this article, a fault-tolerance method that based on similar resources and related technologies are proposed and discussed. First, several mathematical models of key elements, such as computing nodes, similar nodes and tasks, are constructed. Then, the similarity computation methods and evaluation criteria are evinced by two different views: tasks and resources. Supported by theories above, numerous methods, such as similar nodes auto-discovery (SNAD) and its optimization one (oSNAD), redundant tasks auto-deployment, and reconfiguration policies of fault tasks and nodes are highlighted respectively. Simulation results show that these approaches and schemes can improve the adaptive fault-tolerance abilities of complicated embedded distributed systems.

Original languageEnglish
Title of host publication2007 International Conference on Parallel Processing Workshops, ICPPW
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages21
Number of pages1
ISBN (Print)0769529348, 9780769529349
DOIs
StatePublished - 2007
Event2007 International Conference on Parallel Processing Workshops, ICPPW 2007 - Xian, China
Duration: 10 Sep 200714 Sep 2007

Publication series

NameProceedings of the International Conference on Parallel Processing Workshops
ISSN (Print)1530-2016

Conference

Conference2007 International Conference on Parallel Processing Workshops, ICPPW 2007
Country/TerritoryChina
CityXian
Period10/09/0714/09/07

Fingerprint

Dive into the research topics of 'A similar resource auto-discovery based adaptive fault-tolerance method for embedded distributed system'. Together they form a unique fingerprint.

Cite this