Back to Search
Start Over
Run-Time Recovery and Failure Analysis of Time-Triggered Traffic in Time Sensitive Networks
- Source :
- IEEE Access, 9:9465102, 91710-91722. Institute of Electrical and Electronics Engineers, IEEE Access, Vol 9, Pp 91710-91722 (2021)
- Publication Year :
- 2021
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2021.
-
Abstract
- Reliability is one of the major concerns of Time Sensitive Networking (TSN). Current systems mostly rely on static redundancy to protect functionality from permanent component failures. This greatly increases the cost of Time-Triggered (TT) flows. Instead, Software Defined Networking (SDN) enables dynamic redundancy. Disrupted traffic can be rerouted by a centralized controller to reduce the cost while maintaining reliability. This paper presents an approach to compute alternative paths at run-time and analyze their impact on reliability. We define a novel three-mode recovery scheme, which includes full functionality, reduced functionality, and emergency halt modes. Run-time recovery for TT flows is explored using Integer Linear Programming (ILP) and a heuristic algorithm. Then, a Markov chain-based design-time reliability analysis is developed to evaluate the Mean Time to Reduced Functionality Mode (MTTRF) and Mean Time to Failure (MTTF) of run-time recoverable systems. Our experiments show that run-time recovery provides better protection against multi-point failures than static redundancy. Compared with the state of the art, our proposed ILP has better routing efficiency. The proposed heuristic algorithm can perform routing and scheduling in polynomial time, but it tends to route multicast flows to longer paths than ILP. Furthermore, when applied to realistic recovery scenarios, our proposed ILP improves the MTTF by up to $2\times $ and the average execution time by up to $20\times $ than the raw ILP of the state of the art. Although less efficient with multicast flows, the heuristic algorithm achieves similar reliability as the ILP, and its worst-case recovery time is below $100ms$ on an embedded ARM processor.
- Subjects :
- times-sensitive networking
Mean time between failures
General Computer Science
Multicast
Network reliability
Computer science
Reliability (computer networking)
Distributed computing
functional safety
General Engineering
TK1-9971
Scheduling (computing)
Redundancy (engineering)
vehicle networks
run-time recovery
General Materials Science
Electrical engineering. Electronics. Nuclear engineering
Software-defined networking
Time complexity
Integer programming
Subjects
Details
- ISSN :
- 21693536
- Volume :
- 9
- Database :
- OpenAIRE
- Journal :
- IEEE Access
- Accession number :
- edsair.doi.dedup.....19fe68a339a39eab5be56ebf7f4d37fc