1. Global peak alignment for comprehensive two-dimensional gas chromatography mass spectrometry using point matching algorithms
- Author
-
Seongho Kim, Hengguang Li, Elisabeth I. Heath, Xiang Zhang, and Beichuan Deng
- Subjects
Smith–Waterman algorithm ,Chromatography ,Matching (graph theory) ,010401 analytical chemistry ,Temperature ,Point set registration ,Signal Processing, Computer-Assisted ,02 engineering and technology ,01 natural sciences ,Biochemistry ,Projection (linear algebra) ,Gas Chromatography-Mass Spectrometry ,Article ,0104 chemical sciences ,Computer Science Applications ,Transformation (function) ,Feature (computer vision) ,Data structure alignment ,0202 electrical engineering, electronic engineering, information engineering ,Two-dimensional gas ,020201 artificial intelligence & image processing ,Molecular Biology ,Algorithm ,Algorithms ,Mathematics - Abstract
Comprehensive two-dimensional gas chromatography coupled with mass spectrometry (GC[Formula: see text][Formula: see text][Formula: see text]GC-MS) has been used to analyze multiple samples in a metabolomics study. However, due to some uncontrollable experimental conditions, such as the differences in temperature or pressure, matrix effects on samples and stationary phase degradation, there is always a shift of retention times in the two GC columns between samples. In order to correct the retention time shifts in GC[Formula: see text][Formula: see text][Formula: see text]GC-MS, the peak alignment is a crucial data analysis step to recognize the peaks generated by the same metabolite in different samples. Two approaches have been developed for GC[Formula: see text][Formula: see text][Formula: see text]GC-MS data alignment: profile alignment and peak matching alignment. However, these existing alignment methods are all based on a local alignment, resulting that a peak may not be correctly aligned in a dense chromatographic region where many peaks are present in a small region. False alignment will result in false discovery in the downstream statistical analysis. We, therefore, develop a global comparison-based peak alignment method using point matching algorithm (PMA-PA) for both homogeneous and heterogeneous data. The developed algorithm PMA-PA first extracts feature points (peaks) in the chromatography and then searches globally the matching peaks in the consecutive chromatography by adopting the projection of rigid and nonrigid transformation. PMA-PA is further applied to two real experimental data sets, showing that PMA-PA is a promising peak alignment algorithm for both homogenous and heterogeneous data in terms of [Formula: see text]1 score, although it uses only peak location information.
- Published
- 2016