In multilingual societies, the challenge of accessing printed information remains acute for visually impaired individuals and non-native English speakers. This work introduces a sophisticated and extensible framework that unites advanced Optical Character Recognition (OCR) algorithms and neural machine translation, specifically engineered for English-to-Tamil conversion. In order to elevate recognition reliability, the methodology incorporates enhanced preprocessing techniques including adaptive thresholding and robust noise reduction algorithms, critically improving performance across diverse document types. Architecturally, the system integrates recent advances in character recognition, leveraging the open-source Tesseract engine and contemporary sequence-to-sequence translation models, with modularity for expansion to additional languages and features. Experimental validation utilizes a diverse array of datasets and is substantiated by standard metrics such as BLEU for translation fidelity. Comparative analyses with recent commercial and academic benchmarks are provided, and results are presented with comprehensive statistical analysis. The entire pipeline is implemented for accessible deployment, emphasizing usability, performance, and extensibility. The findings confirm highly competitive OCR accuracy and translation reliability, with particular suitability for resource-constrained environments and social inclusion efforts.
Automated Multilingual Content Delivery for the Visually Impaired via AI-Driven Document Parsing
M. Selvaganapathy,N. Nishavithri,P. Prabakaran,R. Nithya,Kanimozhi Rajasekaran,A. B. Joice
Published 2025 in 2025 Fourth International Conference on Smart Technologies and Systems for Next Generation Computing (ICSTSN)
ABSTRACT
PUBLICATION RECORD
- Publication year
2025
- Venue
2025 Fourth International Conference on Smart Technologies and Systems for Next Generation Computing (ICSTSN)
- Publication date
2025-11-20
- Fields of study
Not labeled
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-14 of 14 references · Page 1 of 1
CITED BY
- No citing papers are available for this paper.
Showing 0-0 of 0 citing papers · Page 1 of 1