Advanced

Control-Quality Optimization for Distributed Embedded Systems with Adaptive Fault Tolerance

Samii, Soheil; Bordoloi, Unmesh D.; Eles, Petru; Peng, Zebo and Cervin, Anton LU (2012) 24th Euromicro Conference on Real-Time Systems In 24th Euromicro Conference on Real-Time Systems (ECRTS), 2012 p.68-77
Abstract
In this paper, we propose a design framework for distributed embedded control systems that ensures reliable execution and high quality of control even if some computation nodes fail. When a node fails, the configuration of the underlying distributed system changes and the system must adapt to this new situation by activating tasks at operational nodes. The task mapping as well as schedules and control laws that are customized for the new configuration influence the control quality and must, therefore, be optimized. The number of possible configurations due to faults is exponential in the number of nodes in the system. This design-space complexity leads to unaffordable design time and large memory requirements to store information related... (More)
In this paper, we propose a design framework for distributed embedded control systems that ensures reliable execution and high quality of control even if some computation nodes fail. When a node fails, the configuration of the underlying distributed system changes and the system must adapt to this new situation by activating tasks at operational nodes. The task mapping as well as schedules and control laws that are customized for the new configuration influence the control quality and must, therefore, be optimized. The number of possible configurations due to faults is exponential in the number of nodes in the system. This design-space complexity leads to unaffordable design time and large memory requirements to store information related to mappings, schedules, and controllers. We demonstrate that it is sufficient to synthesize solutions for a small number of base and minimal configurations to achieve fault tolerance with an inherent minimum level of control quality. We also propose an algorithm to further improve control quality with a priority-based search of the set of configurations and trade-offs between task migration and replication. (Less)
Please use this url to cite or link to this publication:
author
organization
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
in
24th Euromicro Conference on Real-Time Systems (ECRTS), 2012
pages
68 - 77
publisher
IEEE--Institute of Electrical and Electronics Engineers Inc.
conference name
24th Euromicro Conference on Real-Time Systems
external identifiers
  • scopus:84866443073
ISSN
1068-3070
ISBN
978-1-4673-2032-0
DOI
10.1109/ECRTS.2012.40
project
ELLIIT-sched
language
English
LU publication?
yes
id
e4027baa-885a-47db-bd76-bd796eb58a68 (old id 2968601)
date added to LUP
2012-08-24 11:28:36
date last changed
2017-01-01 05:35:20
@inproceedings{e4027baa-885a-47db-bd76-bd796eb58a68,
  abstract     = {In this paper, we propose a design framework for distributed embedded control systems that ensures reliable execution and high quality of control even if some computation nodes fail. When a node fails, the configuration of the underlying distributed system changes and the system must adapt to this new situation by activating tasks at operational nodes. The task mapping as well as schedules and control laws that are customized for the new configuration influence the control quality and must, therefore, be optimized. The number of possible configurations due to faults is exponential in the number of nodes in the system. This design-space complexity leads to unaffordable design time and large memory requirements to store information related to mappings, schedules, and controllers. We demonstrate that it is sufficient to synthesize solutions for a small number of base and minimal configurations to achieve fault tolerance with an inherent minimum level of control quality. We also propose an algorithm to further improve control quality with a priority-based search of the set of configurations and trade-offs between task migration and replication.},
  author       = {Samii, Soheil and Bordoloi, Unmesh D. and Eles, Petru and Peng, Zebo and Cervin, Anton},
  booktitle    = {24th Euromicro Conference on Real-Time Systems (ECRTS), 2012},
  isbn         = {978-1-4673-2032-0},
  issn         = {1068-3070},
  language     = {eng},
  pages        = {68--77},
  publisher    = {IEEE--Institute of Electrical and Electronics Engineers Inc.},
  title        = {Control-Quality Optimization for Distributed Embedded Systems with Adaptive Fault Tolerance},
  url          = {http://dx.doi.org/10.1109/ECRTS.2012.40},
  year         = {2012},
}