CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Efficient document image binarization using heterogeneous computing and parameter tuning
Blekinge Tekniska Högskola, Institutionen för datalogi och datorsystemteknik.ORCID iD: 0000-0002-2161-7371
Blekinge Tekniska Högskola, Institutionen för datalogi och datorsystemteknik.ORCID iD: 0000-0001-9947-1088
Jönköping University, School of Engineering, JTH, Computer Science and Informatics, JTH, Jönköping AI Lab (JAIL). Blekinge Tekniska Högskola, Institutionen för datalogi och datorsystemteknik.ORCID iD: 0000-0002-0535-1761
2018 (English)In: International Journal on Document Analysis and Recognition, ISSN 1433-2833, E-ISSN 1433-2825, Vol. 21, no 1-2, p. 41-58Article in journal (Refereed) Published
Abstract [en]

In the context of historical document analysis, image binarization is a first important step, which separates foreground from background, despite common image degradations, such as faded ink, stains, or bleed-through. Fast binarization has great significance when analyzing vast archives of document images, since even small inefficiencies can quickly accumulate to years of wasted execution time. Therefore, efficient binarization is especially relevant to companies and government institutions, who want to analyze their large collections of document images. The main challenge with this is to speed up the execution performance without affecting the binarization performance. We modify a state-of-the-art binarization algorithm and achieve on average a 3.5 times faster execution performance by correctly mapping this algorithm to a heterogeneous platform, consisting of a CPU and a GPU. Our proposed parameter tuning algorithm additionally improves the execution time for parameter tuning by a factor of 1.7, compared to previous parameter tuning algorithms. We see that for the chosen algorithm, machine learning-based parameter tuning improves the execution performance more than heterogeneous computing, when comparing absolute execution times. © 2018 The Author(s)

Place, publisher, year, edition, pages
Springer, 2018. Vol. 21, no 1-2, p. 41-58
Keywords [en]
Automatic parameter tuning, Heterogeneous computing, Historical documents, Image binarization, Bins, History, Image analysis, Learning systems, Document image binarization, Government institutions, Heterogeneous platforms, Parameter tuning algorithm, Parameter estimation
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hj:diva-42993DOI: 10.1007/s10032-017-0293-7ISI: 000433193500003Scopus ID: 2-s2.0-85041228615OAI: oai:DiVA.org:hj-42993DiVA, id: diva2:1288959
Available from: 2019-02-15 Created: 2019-02-15 Last updated: 2019-08-20Bibliographically approved

Open Access in DiVA

fulltext(1264 kB)216 downloads
File information
File name FULLTEXT01.pdfFile size 1264 kBChecksum SHA-512
481d903fc9c988715e1cc1a3206ab64c8c6daccfd158c4a71ab8ff5df016e59e80910d6f16b8e7af5b0dc5e93866a852a46b9e803dbbb400a5f58f97dc208a15
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Westphal, FlorianGrahn, HåkanLavesson, Niklas

Search in DiVA

By author/editor
Westphal, FlorianGrahn, HåkanLavesson, Niklas
By organisation
JTH, Jönköping AI Lab (JAIL)
In the same journal
International Journal on Document Analysis and Recognition
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 216 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 315 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf