Medical Image Character Recognition Using Attention-Based Siamese Networks for Visually Similar Characters with Low Resolution

Efosa Osagie, Wei Ji, Na Helian

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The emergence of optical character recognition (OCR) has been adopted in many domains to automate various tasks. Still, recognising visually similar characters (VSC) remains a challenging problem in the general OCR domain. Applying conventional class probability predictions by deep learning techniques may be difficult due to the limited datasets in some domains, such as medical imaging modalities. VSC recognition becomes more complicated due to the image’s low resolution and background interference. With advancements in computing power and numerical methods, techniques such as the few-shot method have been proposed to tackle the limited sample problems in training deep learning models. Still, very little work has been done regarding designing an OCR solution to deal with tiny burnt-textual data on low-resolution images with background interference while training on small samples per class. In this study, we propose an Attention-based Siamese Network to accurately recognise VSC by efficiently learning the semantic similarities between the extracted embeddings from the input images. The learned similarities and attention-focused feature extraction layer enable the proposed model to discriminate between different character classes efficiently, with only small samples available. Bayesian optimisation is used to determine optimal network parameters. We aim to set a benchmark for the performance of the Siamese network in OCR in medical image character recognition in terms of parameter size and accuracy at a determined sample size.
Original languageEnglish
Title of host publicationProceedings of the 3rd International Conference on Innovations in Computing Research (ICR’24)
EditorsKevin Daimi, Abeer Al Sadoon
PublisherSpringer Nature
Pages110–119
Number of pages10
ISBN (Electronic)978-3-031-65522-7
ISBN (Print)978-3-031-65521-0
DOIs
Publication statusPublished - 1 Aug 2024
EventThird International Conference on Innovations in Computing Research (ICR’24) - Athens, Greece
Duration: 12 Aug 202414 Aug 2024
https://iicser.org/icr24/

Publication series

NameLecture Notes in Networks and Systems
PublisherSpringer
Volume1058 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

ConferenceThird International Conference on Innovations in Computing Research (ICR’24)
Abbreviated titleICR 2024
Country/TerritoryGreece
CityAthens
Period12/08/2414/08/24
Internet address

Keywords

  • Burned-in Textual data
  • Medical Image Character Recognition
  • Siamese network
  • few-shot
  • small datasets

Cite this