Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Learning character recognition with graph-based privileged information
Department of Computer Science, Blekinge Institute of Technology, Karlskrona, Sweden.
Jönköping University, School of Engineering, JTH, Department of Computing, Jönköping AI Lab (JAIL).ORCID iD: 0000-0002-0535-1761
Department of Computer Science, Blekinge Institute of Technology, Karlskrona, Sweden.
2019 (English)In: Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, IEEE, 2019, p. 1163-1168Conference paper, Published paper (Refereed)
Abstract [en]

This paper proposes a pre-training method for neural network-based character recognizers to reduce the required amount of training data, and thus the human labeling effort. The proposed method transfers knowledge about the similarities between graph representations of characters to the recognizer by training to predict the graph edit distance. We show that convolutional neural networks trained with this method outperform traditional supervised learning if only ten or less labeled images per class are available. Furthermore, we show that our approach performs up to 33% better than a graph edit distance based recognition approach, even if only one labeled image per class is available. 

Place, publisher, year, edition, pages
IEEE, 2019. p. 1163-1168
Series
Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, ISSN 1520-5363, E-ISSN 2379-2140
Keywords [en]
Character recognition, Convolutional neural networks, Graph matching, Learning using privileged information, Convolution, Graphic methods, Graph edit distance, Graph matchings, Graph representation, Labeled images, Method transfers, Pre-training, Training data
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:hj:diva-47958DOI: 10.1109/ICDAR.2019.00188Scopus ID: 2-s2.0-85079896010ISBN: 9781728128610 (print)OAI: oai:DiVA.org:hj-47958DiVA, id: diva2:1413737
Conference
15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia, 20 - 25 September 2019
Funder
Knowledge Foundation, 20140032Available from: 2020-03-11 Created: 2020-03-11 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Lavesson, Niklas

Search in DiVA

By author/editor
Lavesson, Niklas
By organisation
Jönköping AI Lab (JAIL)
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 107 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf