Back to Search Start Over

Service-Oriented Reliability Modeling and Autonomous Optimization of Reliability for Public Cloud Computing Systems.

Authors :
Meng, Sa
Luo, Liang
Qiu, Xiwei
Dai, Yuanshun
Source :
IEEE Transactions on Reliability. Jun2022, Vol. 71 Issue 2, p527-538. 12p.
Publication Year :
2022

Abstract

The ubiquitous adoption of public clouds has resulted in the bloom of big data, Internet of Things, and artificial intelligence (AI) and a great capacity for applications. However, the efficient operation of these applications is challenging due to the infrastructure heterogeneity, computing hierarchy, scale distribution, and stochastic user behaviors. Among these challenges, reliability has received intense interest in the cloud community. Most existing research on the reliability of clouds is on fault discovery or fault tolerance. However, a public cloud has a wider range of failures, which makes it difficult to analyze the association between reliability and other indicators. More accurate reliability modeling methods and advanced optimization approaches are needed to ensure the reliable operation of a public cloud. To address these challenges, this article models the reliability of cloud computing from the service perspective and reasonably divides cloud services into the request processing phase and request execution phase when facing multiple users and service types. Moreover, this article combines a variety of AI-based methods, such as autonomous scheduling mechanisms, anomaly detection, and autonomous learning improvement, which can effectively adapt to the dynamic and complex cloud service environment and ensure the service reliability of a public cloud. Numerical results clearly indicate that the proposed service reliability model and autonomous optimizations are efficient for recovering the reliability of cloud computing systems in terms of multiple AI-based failure prognostics and intelligent learning ability. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00189529
Volume :
71
Issue :
2
Database :
Academic Search Index
Journal :
IEEE Transactions on Reliability
Publication Type :
Academic Journal
Accession number :
157258521
Full Text :
https://doi.org/10.1109/TR.2022.3154651