skip to main content
10.1145/2595188.2595199acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdatechConference Proceedingsconference-collections
research-article

A bimodal crowdsourcing platform for demographic historical manuscripts

Published:19 May 2014Publication History

ABSTRACT

In this paper we present a crowdsourcing web-based application for extracting information from demographic handwritten document images. The proposed application integrates two points of view: the semantic information for demographic research, and the ground-truthing for document analysis research. Concretely, the application has the contents view, where the information is recorded into forms, and the labeling view, with the word labels for evaluating document analysis techniques. The crowdsourcing architecture allows to accelerate the information extraction (many users can work simultaneously), validate the information, and easily provide feedback to the users. We finally show how the proposed application can be extended to other kind of demographic historical manuscripts.

References

  1. A. Amato, A. Sappa, A. Fornés, F. Lumbreras, and J. Lladós. Divide and conquer: Atomizing and parallelizing a task in a mobile crowdsourcing platform. In 2nd International ACM Workshop on Crowdsourcing for Multimedia (CrowdMM), pages 21--22, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Averkamp and M. Butler. The care and feeding of a crowd. In Code4Lib Conference, February 2013. http://code4lib.org/conference/2013/averkamp-butler.Google ScholarGoogle Scholar
  3. N. Cirera, A. Fornés, V. Frinken, and J. Lladós. Hybrid grammar language model for handwritten historical documents recognition. In Pattern Recognition and Image Analysis, volume 7887, pages 117--124, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  4. C. Clausner, S. Pletschacher, and A. Antonacopoulos. Aletheia-an advanced document layout and text ground-truthing system for production environments. In International Conference on Document Analysis and Recognition (ICDAR), pages 48--52. IEEE, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. F. Le Bourgeois and H. Emptoz. Debora: Digital access to books of the renaissance. International Journal of Document Analysis and Recognition (IJDAR), 9(2-4):193--221, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. G. Noll. Crowdsourcing transcriptions of archival materials. In Graduate History Conference, pages 1--33, march 2013.Google ScholarGoogle Scholar
  7. V. Romero, F. A., N. Serrano, J. Sánchez, A. Toselli, V. Frinken, E. Vidal, and J. Lladós. The {ESPOSALLES} database: An ancient marriage license corpus for off-line handwriting recognition. Pattern Recognition, 46(6):1658--1669, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. V. Romero, A. H. Toselli, and E. Vidal. Multimodal Interactive Handwritten Text Transcription. Series in Machine Perception and Artificial Intelligence (MPAI). World Scientific Publishing, 2012. http://www.worldscientific.com/worldscibooks/10.1142/8394.Google ScholarGoogle Scholar
  9. E. Saund, J. Lin, and P. Sarkar. Pixlabeler: User interface for pixel-level labeling of elements in document images. In 10th International Conference on Document Analysis and Recognition (ICDAR), pages 646--650. IEEE, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M.-C. Yuen, I. King, and K.-S. Leung. A survey of crowdsourcing systems. In IEEE third International Conference on Privacy, security, risk and trust (PASSAT), and IEEE third International Conference on Social Computing (Socialcom), pages 766--773. IEEE, 2011.Google ScholarGoogle Scholar

Index Terms

  1. A bimodal crowdsourcing platform for demographic historical manuscripts

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      DATeCH '14: Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage
      May 2014
      200 pages
      ISBN:9781450325882
      DOI:10.1145/2595188

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 19 May 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      DATeCH '14 Paper Acceptance Rate31of49submissions,63%Overall Acceptance Rate60of86submissions,70%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader