Back to Search
Start Over
Chemometrics models for overcoming high between subject variability: applications in clinical metabolic profiling studies
- Source :
- Metabolomics. 10:375-385
- Publication Year :
- 2013
- Publisher :
- Springer Science and Business Media LLC, 2013.
-
Abstract
- In human metabolic profiling studies, between-subject variability is often the dominant feature and can mask the potential classifications of clinical interest. Conventional models such as principal component analysis (PCA) are usually not effective in such situations and it is therefore highly desirable to find a suitable model which is able to discover the underlying pattern hidden behind the high between-subject variability. In this study we employed two clinical metabolomics data sets as the testing grounds, in which such variability had been observed, and we demonstrate that a proper choice of chemometrics model can help to overcome this issue of high between-subject variability. Two data sets were used to represent two different types of experiment designs. The first data set was obtained from a small-scale study investigating volatile organic compounds (VOCs) collected from chronic wounds using a skin patch device and analysed by thermal desorption-gas chromatography-mass spectrometry. Five patients were recruited and for each patient three sites sampled in triplicate: healthy skin, boundary of the lesion and top of the lesion, the aim was to discriminate these three types of samples based on their VOC profile. The second data set was from a much larger study involving 35 healthy subjects, 47 patients with chronic obstructive pulmonary disease and 33 with asthma. The VOCs in the breath of each subject were collected using a mask device and analysed again by GC–MS with the aim of discriminating the three types of subjects based on breath VOC profiles. Multilevel simultaneous component analysis, multilevel partial least squares for discriminant analysis, ANOVA-PCA, and a novel simplified ANOVA-PCA model—which we have named ANOVA-Mean Centre (ANOVA-MC)—were applied on these two data sets. Significantly improved results were obtained by using these models. We also present a novel validation procedure to verify statistically the results obtained from those models.
- Subjects :
- Computer science
Endocrinology, Diabetes and Metabolism
Clinical Biochemistry
computer.software_genre
Linear discriminant analysis
Biochemistry
Skin patch
Data set
Chemometrics
Breath gas analysis
ANOVA–simultaneous component analysis
Principal component analysis
Partial least squares regression
Data mining
computer
Subjects
Details
- ISSN :
- 15733890 and 15733882
- Volume :
- 10
- Database :
- OpenAIRE
- Journal :
- Metabolomics
- Accession number :
- edsair.doi...........0063a214108b6e12d0410731ea6d3487
- Full Text :
- https://doi.org/10.1007/s11306-013-0616-8