TY - JOUR
T1 - Bayesian-Optimised Latent Encoding and Agent-Based Simulation for Enhanced Medical Image Character Recognition
AU - Osagie, Efosa
AU - Ji, Wei
AU - Helian, Na
PY - 2025/11/19
Y1 - 2025/11/19
N2 - This paper presents a Bayesian-optimised Conditional Variational Autoencoder (CVAE) for synthetic data augmentation, embedded within an agent-based simulation framework. The CVAE systematically refines latent-space representations, generating high-quality synthetic character images that enhance dataset diversity and reduce the risk of overfitting. Bayesian optimisation ensures optimal latent variable selection, improving reconstruction accuracy while enabling scalable Medical Image Character Recognition (MICR) training. The proposed agent-based system introduces autonomous agents: patient agents, doctor agents, imaging device agents, and recognition agents that collaborate to simulate real-world MICR workflows. This structured pipeline enables dynamic dataset augmentation while supporting medical diagnostics and automated text extraction. Experimental evaluations demonstrate significant performance improvements, with CNN models achieving accuracy gains of +3.2%, +3.5%, and +1.79% on the public dataset and +2.41%, +6.85%, and +1.60% on the private dataset when augmented with 50, 100, and 150 synthetic images per class, respectively. This research validates the effectiveness of Bayesian-tuned latent-space encoding and a supporting agent-based data augmentation, offering a scalable, computationally efficient solution for MICR enhancement.
AB - This paper presents a Bayesian-optimised Conditional Variational Autoencoder (CVAE) for synthetic data augmentation, embedded within an agent-based simulation framework. The CVAE systematically refines latent-space representations, generating high-quality synthetic character images that enhance dataset diversity and reduce the risk of overfitting. Bayesian optimisation ensures optimal latent variable selection, improving reconstruction accuracy while enabling scalable Medical Image Character Recognition (MICR) training. The proposed agent-based system introduces autonomous agents: patient agents, doctor agents, imaging device agents, and recognition agents that collaborate to simulate real-world MICR workflows. This structured pipeline enables dynamic dataset augmentation while supporting medical diagnostics and automated text extraction. Experimental evaluations demonstrate significant performance improvements, with CNN models achieving accuracy gains of +3.2%, +3.5%, and +1.79% on the public dataset and +2.41%, +6.85%, and +1.60% on the private dataset when augmented with 50, 100, and 150 synthetic images per class, respectively. This research validates the effectiveness of Bayesian-tuned latent-space encoding and a supporting agent-based data augmentation, offering a scalable, computationally efficient solution for MICR enhancement.
U2 - 10.38124/ijsrmt.v4i11.965
DO - 10.38124/ijsrmt.v4i11.965
M3 - Article
SN - 2583-4622
VL - 4
SP - 84
EP - 94
JO - International Journal of Scientific Research and Modern Technology
JF - International Journal of Scientific Research and Modern Technology
IS - 11
ER -