Back to Search
Start Over
Semi-supervised Feature Learning For Improving Writer Identification
- Source :
- Information Sciences (Volume 482, May 2019, Pages 156-170)
- Publication Year :
- 2018
-
Abstract
- Data augmentation is usually used by supervised learning approaches for offline writer identification, but such approaches require extra training data and potentially lead to overfitting errors. In this study, a semi-supervised feature learning pipeline was proposed to improve the performance of writer identification by training with extra unlabeled data and the original labeled data simultaneously. Specifically, we proposed a weighted label smoothing regularization (WLSR) method for data augmentation, which assigned the weighted uniform label distribution to the extra unlabeled data. The WLSR method could regularize the convolutional neural network (CNN) baseline to allow more discriminative features to be learned to represent the properties of different writing styles. The experimental results on well-known benchmark datasets (ICDAR2013 and CVL) showed that our proposed semi-supervised feature learning approach could significantly improve the baseline measurement and perform competitively with existing writer identification approaches. Our findings provide new insights into offline write identification.<br />Comment: This manuscript is submitting to Information Science
- Subjects :
- Computer Science - Machine Learning
Statistics - Machine Learning
Subjects
Details
- Database :
- arXiv
- Journal :
- Information Sciences (Volume 482, May 2019, Pages 156-170)
- Publication Type :
- Report
- Accession number :
- edsarx.1807.05490
- Document Type :
- Working Paper
- Full Text :
- https://doi.org/10.1016/j.ins.2019.01.024