Back to Search
Start Over
Synthetic Biological Signals Machine-Generated by GPT-2 Improve the Classification of EEG and EMG Through Data Augmentation
- Source :
- IEEE Robotics and Automation Letters. 6:3498-3504
- Publication Year :
- 2021
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2021.
-
Abstract
- Synthetic data augmentation is of paramount importance for machine learning classification, particularly for biological data, which tend to be high dimensional and with a scarcity of training samples. The applications of robotic control and augmentation in disabled and able-bodied subjects still rely mainly on subject-specific analyses. Those can rarely be generalised to the whole population and appear to over complicate simple action recognition such as grasp and release (standard actions in robotic prosthetics and manipulators). We show for the first time that multiple GPT-2 models can machine-generate synthetic biological signals (EMG and EEG) and improve real data classification. Models trained solely on GPT-2 generated EEG data can classify a real EEG dataset at 74.71% accuracy and models trained on GPT-2 EMG data can classify real EMG data at 78.24% accuracy. Synthetic and calibration data are then introduced within each cross validation fold when benchmarking EEG and EMG models. Results show algorithms are improved when either or both additional data are used. A Random Forest achieves a mean 95.81% (1.46) classification accuracy of EEG data, which increases to 96.69% (1.12) when synthetic GPT-2 EEG signals are introduced during training. Similarly, the Random Forest classifying EMG data increases from 93.62% (0.8) to 93.9% (0.59) when training data is augmented by synthetic EMG signals. Additionally, as predicted, augmentation with synthetic biological signals also increases the classification accuracy of data from new subjects that were not observed during training. A Robotiq 2F-85 Gripper was finally used for real-time gesture-based control, with synthetic EMG data augmentation remarkably improving gesture recognition accuracy, from 68.29% to 89.5%.
- Subjects :
- Control and Optimization
Computer science
Data classification
Population
Biomedical Engineering
02 engineering and technology
Synthetic data
Cross-validation
Data modeling
03 medical and health sciences
0302 clinical medicine
Artificial Intelligence
0202 electrical engineering, electronic engineering, information engineering
education
Biological data
education.field_of_study
Training set
business.industry
Mechanical Engineering
Pattern recognition
Computer Science Applications
Random forest
Human-Computer Interaction
Statistical classification
Control and Systems Engineering
Gesture recognition
020201 artificial intelligence & image processing
Computer Vision and Pattern Recognition
Artificial intelligence
business
030217 neurology & neurosurgery
Subjects
Details
- ISSN :
- 23773774 and 23773766
- Volume :
- 6
- Database :
- OpenAIRE
- Journal :
- IEEE Robotics and Automation Letters
- Accession number :
- edsair.doi.dedup.....2b1e93499310e0ca8c90d7809a9a903a
- Full Text :
- https://doi.org/10.1109/lra.2021.3056355