Abstract
In order to improve the results of automatically recognized handwritten text, information about the language is commonly included in the recognition process. A common approach is to represent a text line as a sequence. It is processed in one direction and the language information via n-grams is directly included in the decoding. This approach, however, only uses context on one side to estimate a word’s probability. Therefore, we propose a bidirectional recognition in this paper, using distinct forward and a backward language models. By combining decoding hypotheses from both directions, we achieve a significant increase in recognition accuracy for the off-line writer independent handwriting recognition task. Both language models are of the same type and can be estimated on the same corpus. Hence, the increase in recognition accuracy comes without any additional need for training data or language modeling complexity.
Chapter PDF
Similar content being viewed by others
References
Bauer, L.: Manual of Information to Accompany The Wellington Corpus of Written New Zealand English. Technical report, Department of Linguistics, Victoria University, Wellington, New Zealand (1993)
Bunke, H., Bengio, S., Vinciarelli, A.: Offline Recognition of Unconstrained Handwritten Texts using HMMs and Statistical Language Models. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(6), 709–720 (2004)
Espana-Boquera, S., Castro-Bleda, M.J., Gorbe-Moya, J., Zamora-Martínez, F.: Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models. IEEE Transactions on Pattern Analysis and Machine Intelligence 33(4), 767–779 (2011)
Fiscus, J.: A Post-processing System to Yield Reduced Word Error Rates: Recognizer Output Voting Error Reduction (ROVER). In: Workshop on Automatic Speech Recognition and Understanding, pp. 347–354. IEEE (December 1997)
Goodman, J.T.: A Bit of Progress in Language Modeling - Extended Version. Technical Report MSR-TR-2001-72, Microsoft Research, One Microsoft Way Redmond, WA 98052, 8 (2001)
Graves, A., Liwicki, M., Fernández, S., Bertolami, R., Bunke, H., Schmidhuber, J.: A novel Connectionist System for Unconstrained Handwriting Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(5), 855–868 (2009)
Jelinek, F.: Stochastic Analysis of Structured Language Modeling. In: Mathematical Foundations of Speech and Language Processing, vol. 138, pp. 37–71. Springer,
Johansson, S., Atwell, E., Garside, R., Leech, G.: The tagged lob corpus: Users’ manual. Technical report, The Norwegian Computing Centre for the Humanities (1986)
Kucera, H., Francis, W.N.: Manual of Information to accompany A Standard Corpus of Present-Day Edited American English, for use with Digital Computers. Brown University, Department of Linguistics, Providence, Rhode Island, 1964. Revised 1971. Revised and amplified (1979)
Marti, U.-V., Bunke, H.: Using a Statistical Language Model to Improve the Performance of an HMM-Based Cursive Handwriting Recognition System. Int. Journal of Pattern Recognition and Artificial Intelligence 15, 65–90 (2001)
Marti, U.V., Bunke, H.: The iam-database: An English Sentence Database for Offline Handwriting Recognition. Int’l Journal on Document Analysis and Recognition 5(1), 39–46 (2002)
Plamondon, R., Srihari, S.N.: Online and Off-Line Handwriting Recognition: A Comprehensive Survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(1), 63–84 (2000)
Plötz, T., Fink, G.A.: Markov Models for Offline Handwriting Recognition: A Survey. Int’l Journal on Document Analysis and Recognition 12(4), 269–298 (2009)
Rosenfeld, R., Chen, S.F., Zh, X.: Whole-Sentence Exponential Language Models: A Vehicle for Linguistic-Statistical Integration. Computers, Speech and Language 15, 55–73 (2001)
Stolcke, A.: SRILM: An Extensible Language Modeling Toolkit, pp. 901–904 (2002)
Stolke, A., König, Y., Weintraub, M.: Explicit Word Error Minimization in N-Best List Rescoring. In: EUROSPEECH, pp. 163–166 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Frinken, V., Fornés, A., Lladós, J., Ogier, JM. (2012). Bidirectional Language Model for Handwriting Recognition. In: Gimel’farb, G., et al. Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2012. Lecture Notes in Computer Science, vol 7626. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34166-3_67
Download citation
DOI: https://doi.org/10.1007/978-3-642-34166-3_67
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34165-6
Online ISBN: 978-3-642-34166-3
eBook Packages: Computer ScienceComputer Science (R0)