Neural Machine Translation tool from Spanish to English in the Medical Domain

Ariel Gordillo,Alvaro Uyaguari,Lucas Garces

Published 2022 in 2022 Third International Conference on Information Systems and Software Technologies (ICI2ST)

ABSTRACT

In Natural Language Processing (NLP), the scarcity of linguistic resources (labeled corpus, parallel corpus, pre-trained models, etc.) can lead to poor performance when applying machine learning models, however, this can be solved by applying cross-lingual approaches (machine translation, word alignment, multilingual embedding, etc.), which is a paradigm for transferring knowledge from one language with resources to another language with fewer resources. In the medical domain, there are also few resources in Spanish compared to English, due to economic, legal, and ethical issues. In this regard, there is little evidence of evaluation and optimization of machine translations from Spanish to English in the medical domain. For this purpose, a neural machine translation tool with an induced word alignment is generated in this research, on which different optimization parameters have been experimented with and applying various parallel corpora within the medical domain, as reference results with the corpora EMA with 15 epochs, a BLUE of 88.55 in English-Spanish and Scielo Spanish - English with 25 epochs, a BLEU of 53.74, being a differential in evaluation results to convolutional translators and even greatly outperforming the pre-trained Fairseq results.

PUBLICATION RECORD

  • Publication year

    2022

  • Venue

    2022 Third International Conference on Information Systems and Software Technologies (ICI2ST)

  • Publication date

    2022-11-01

  • Fields of study

    Not labeled

  • Identifiers
  • External record

    Open on Semantic Scholar

  • Source metadata

    Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

  • No claims are published for this paper.

CONCEPTS

  • No concepts are published for this paper.

REFERENCES

Showing 1-21 of 21 references · Page 1 of 1

CITED BY

  • No citing papers are available for this paper.

Showing 0-0 of 0 citing papers · Page 1 of 1