Back to Search
Start Over
Large-scale assessment of consistency in sleep stage scoring rules among multiple sleep centers using an interpretable machine learning algorithm
- Source :
- J Clin Sleep Med
- Publication Year :
- 2021
- Publisher :
- American Academy of Sleep Medicine (AASM), 2021.
-
Abstract
- STUDY OBJECTIVES: Polysomnography is the gold standard in identifying sleep stages; however, there are discrepancies in how technicians use the standards. Because organizing meetings to evaluate this discrepancy and/or reach a consensus among multiple sleep centers is time-consuming, we developed an artificial intelligence system to efficiently evaluate the reliability and consistency of sleep scoring and hence the sleep center quality. METHODS: An interpretable machine learning algorithm was used to evaluate the interrater reliability (IRR) of sleep stage annotation among sleep centers. The artificial intelligence system was trained to learn raters from 1 hospital and was applied to patients from the same or other hospitals. The results were compared with the experts’ annotation to determine IRR. Intracenter and intercenter assessments were conducted on 679 patients without sleep apnea from 6 sleep centers in Taiwan. Centers with potential quality issues were identified by the estimated IRR. RESULTS: In the intracenter assessment, the median accuracy ranged from 80.3%–83.3%, with the exception of 1 hospital, which had an accuracy of 72.3%. In the intercenter assessment, the median accuracy ranged from 75.7%–83.3% when the 1 hospital was excluded from testing and training. The performance of the proposed method was higher for the N2, awake, and REM sleep stages than for the N1 and N3 stages. The significant IRR discrepancy of the 1 hospital suggested a quality issue. This quality issue was confirmed by the physicians in charge of the 1 hospital. CONCLUSIONS: The proposed artificial intelligence system proved effective in assessing IRR and hence the sleep center quality. CITATION: Liu G-R, Lin T-Y, Wu H-T, et al. Large-scale assessment of consistency in sleep stage scoring rules among multiple sleep centers using an interpretable machine learning algorithm. J Clin Sleep Med. 2021;17(2):159–166.
- Subjects :
- Pulmonary and Respiratory Medicine
Taiwan
Polysomnography
Machine learning
computer.software_genre
Machine Learning
03 medical and health sciences
Consistency (database systems)
0302 clinical medicine
Artificial Intelligence
parasitic diseases
Humans
Medicine
Sleep Stages
medicine.diagnostic_test
business.industry
Reproducibility of Results
Gold standard (test)
Scientific Investigations
Inter-rater reliability
030228 respiratory system
Neurology
Scale (social sciences)
Neurology (clinical)
Artificial intelligence
Sleep (system call)
Sleep
business
computer
Algorithms
030217 neurology & neurosurgery
Subjects
Details
- ISSN :
- 15509397 and 15509389
- Volume :
- 17
- Database :
- OpenAIRE
- Journal :
- Journal of Clinical Sleep Medicine
- Accession number :
- edsair.doi.dedup.....f9afddf9ede62989bf2ea71b4895cc77
- Full Text :
- https://doi.org/10.5664/jcsm.8820