research-article

Document noise removal using sparse representations over learned dictionary

Authors:
Do Thanh-Ha

Université de Lorraine-LORIA UMR 7503, Campus scientifique - BP 239, Vandoeuvre-lès-Nancy, France

Université de Lorraine-LORIA UMR 7503, Campus scientifique - BP 239, Vandoeuvre-lès-Nancy, France
View Profile

,
Salvatore Tabbone

Université de Lorraine-LORIA UMR 7503, Campus scientifique - BP 239, Vandoeuvre-lès-Nancy, France

Université de Lorraine-LORIA UMR 7503, Campus scientifique - BP 239, Vandoeuvre-lès-Nancy, France
View Profile

,
Oriol Ramos Terrades

Computer Vision Centre 08193 Bellaterra (Cerdanyola), Barcelona, Spain

Computer Vision Centre 08193 Bellaterra (Cerdanyola), Barcelona, Spain
View Profile

DocEng '13: Proceedings of the 2013 ACM symposium on Document engineeringSeptember 2013Pages 161–168https://doi.org/10.1145/2494266.2494281

Published:10 September 2013Publication History

DocEng '13: Proceedings of the 2013 ACM symposium on Document engineering

Pages 161–168

ABSTRACT

In this paper, we propose an algorithm for denoising document images using sparse representations. Following a training set, this algorithm is able to learn the main document characteristics and also, the kind of noise included into the documents. In this perspective, we propose to model the noise energy based on the normalized cross-correlation between pairs of noisy and non-noisy documents. Experimental results on several datasets demonstrate the robustness of our method compared with the state-of-the-art.

References

M. Aharon, M. Elad, and A. Bruckstein. K-svd: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Transactions on signal processing, 54(11):4311--4322, 2006. Google ScholarDigital Library
E. Barney. Modeling image degradations for improving ocr. In European Conference on Signal Processing, pages 1--5, 2008.Google Scholar
S.S. Chen, D.L. Donoho, and M.A. Saunders. Atomic decomposition by basis pursuit. SIAM Journal on Scientific Computing, 20(1):33--61, 1998. Google ScholarDigital Library
S. S. Choi, S. H. Cha, and C. Tappert. A survey of binary similarity and distance measures. Journal on Systemics, Cybernetics and Informatics, 8(1):43--48, 2010.Google Scholar
I. Daubechies, R. Devore, M. Fornasier, and C.S Gunturk. Iteratively reweighted least squares minimization for sparse recovery. Communications on Pure and Applied Mathematics, 63(1):1--38, october 2009.Google ScholarCross Ref
E. Davies. Machine Vision: Theory, Algorithms and Practicalities. Academic Press, 1990.Google Scholar
D. Donoho and M. Elad. Optimally sparse representation in general (nonorthogonal) dictionaries via l1 minimization. Proceeding of the National Academy of Sciences of the United States of America, 100(5):2197--2202, 2003.Google ScholarCross Ref
M. Elad. Sparse and redundant representation: From theory to applications in signal and images processing. Springer, Reading, Massachusetts, 2010. Google ScholarDigital Library
K. Engan, S. O. Aase, and J. H. Husoy. Frame based signal compression using method of optimal directions (mod). In International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1999.Google ScholarCross Ref
I. Gonzalez and B. Rao. Sparse signal reconstruction from limited data using focuss: a re-weighted minimum norm algorithm. Signal Processing, 45(3):600--616, March 1997. Google ScholarDigital Library
V-T. Hoang, E.H. Barney Smith, and S. Tabbone. Edge noise removel in bilevel graphical document images using sparse representation. In IEEE international conference on Image Processing, 2011.Google Scholar
T. Kanungo, R. M. Haralick, and I. T. Phillips. Global and local document degradation models. In Proceedings of the Second International Conference on Document Analysis and Recognition, pages 730--734, October 1993.Google ScholarCross Ref
T. Kanungo, R.M. Haralick, H.S. Baird, W. Stuezle, and D. Madigan. A statistical, nonparametric methodology for document degradation model validation. IEEE Transactions on PAMI, 22(11):1209--1223, June 2000. Google ScholarDigital Library
J. P. Lewis. Fast normalized cross-correlation. Vision Interface, 1995.Google Scholar
S. G. Mallat and Z. Zhang. Matching pursuits with time-frequency dictionaries. Signal Processing, 41(12):3397--3415, 1993. Google ScholarDigital Library
P. Marrgos and R.W. Schafer. Morphological filters, part 2: Their relations to median, order-statistic, and stack filters. IEEE Transactions on acoustics, speech, and signal processing, 35(8):87--134, 1987.Google Scholar
Y. Pati, R. Rezaiifar, and P. Krishnaprasad. Orthogonal matching pursuit: Recursive function approximation with applications to wavelet decomposition. In Proceedings of the 27th Annual Asilomar Conference on Signals, Systems, and Computers, pages 40--44, 1993.Google ScholarCross Ref
J.L. Starck, E.J. Candes, and D.L. Donoho. The curvelet transform for image denoising. IEEE Transactions on image processing, 11(6):670--684, 2002. Google ScholarDigital Library
V. N. Temlyakov. Weak greedy algorithms. Advances in Computational Mathematics, 5:173--187, 2000.Google Scholar
Z. Wang, A. C. Bovik, H. R. Sheikh, and E. P. Simoncelli. Image quality assessment: From error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4):600--612, 2004. Google ScholarDigital Library

Index Terms

Document noise removal using sparse representations over learned dictionary
1. Applied computing

Recommendations

Removal of random-valued impulse noise using overcomplete DCT dictionary
CUBE '12: Proceedings of the CUBE International Information Technology Conference

This paper proposes a novel two-stage denoising method for removing random-valued impulse noise from an image. First, a modified adaptive center-weighted median filter (MACWMF) is used to detect the pixels which are likely to be corrupted by the impulse ...
Read More
Sparse regularization method for the detection and removal of random-valued impulse noise

In this paper, we propose a novel two-stage algorithm for the detection and removal of random-valued impulse noise using sparse representations. The main aim of the paper is to demonstrate the strength of image inpainting technique for the ...
Read More
Dictionary learning based impulse noise removal via L1-L1 minimization

To effectively remove impulse noise in natural images while keeping image details intact, this paper proposes a dictionary learning based impulse noise removal (DL-INR) algorithm, which explores both the strength of the patch-wise adaptive dictionary ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
DocEng '13: Proceedings of the 2013 ACM symposium on Document engineering
September 2013
582 pages
ISBN:9781450317894
DOI:10.1145/2494266
Conference Chair:
Simone Marinai
University of Florence, Italy
,
Program Chair:
Kim Marriott
Monash University, Australia
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 10 September 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
k-svd
learned dictionary
noise suppression
normalized cross correlation
sparse representation
Qualifiers
- research-article
Conference

Acceptance Rates
DocEng '13 Paper Acceptance Rate16of50submissions,32%Overall Acceptance Rate178of537submissions,33%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 138
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Document noise removal using sparse representations over learned dictionary

DocEng '13: Proceedings of the 2013 ACM symposium on Document engineering

ABSTRACT

References

Cited By

Index Terms

Recommendations

Removal of random-valued impulse noise using overcomplete DCT dictionary

Sparse regularization method for the detection and removal of random-valued impulse noise

Dictionary learning based impulse noise removal via L1-L1 minimization

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Document noise removal using sparse representations over learned dictionary

DocEng '13: Proceedings of the 2013 ACM symposium on Document engineering

ABSTRACT

References

Cited By

Index Terms

Recommendations

Removal of random-valued impulse noise using overcomplete DCT dictionary

Sparse regularization method for the detection and removal of random-valued impulse noise

Dictionary learning based impulse noise removal via L1-L1 minimization

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media