Back to Search
Start Over
Blind bleed-through removal for scanned historical document image with conditional random fields
- Source :
- IEEE Transactions on Image Processing. 25:5702-5712
- Publication Year :
- 2016
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2016.
-
Abstract
- Scanned images of historical documents often suffer from bleed-through, which refers to the ink on one side seeping through the paper and appearing on the other side. In this paper, a new conditional random field (CRF)-based method is proposed to remove the bleed-through from the scanned images of historical images. The proposed method only requires the scanned image of one side, referred as a blind method. In general, the scanned historical document image is composed of three components: foreground, bleed-through, and background. By assuming Gaussian distributions of the three components, the proposed method establishes conditional probability distribution (CPD) models of the three components first. The parameters of the component CPD models are estimated based on an initial segmentation of the input image. Then, CRFs are used to capture the relations between observed pixels in the scanned image and the corresponding labels as well as the spatial relation between the adjacent labels. The belief propagation algorithm is used to calculate the probabilities of different labels for each pixel. Once the labeling is completed by choosing the most possible label for each pixel, the bleed-through component is removed from the input historical image by a random-filling inpainting algorithm. Experimental results on the real data set show that the proposed method preserves the foreground component very well and removes the bleed-through effectively.
- Subjects :
- Conditional random field
Pixel
business.industry
Inpainting
020207 software engineering
Pattern recognition
02 engineering and technology
Image segmentation
Conditional probability distribution
Belief propagation
Computer Graphics and Computer-Aided Design
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Computer vision
Artificial intelligence
Hidden Markov model
business
Software
Historical document
Mathematics
Subjects
Details
- ISSN :
- 19410042 and 10577149
- Volume :
- 25
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Image Processing
- Accession number :
- edsair.doi.dedup.....2ddd8046da8f915dae94f06826c825fd