Enhancing Neural Arabic Machine Translation using Character-Level CNN-BILSTM and Hybrid Attention

Published 2024 in Engineering, Technology & Applied Science Research

ABSTRACT

Neural Machine Translation (NMT) has made significant strides in recent years, especially with the advent of deep learning, which has greatly enhanced performance across various Natural Language Processing (NLP) tasks. Despite these advances, NMT still falls short of perfect translation, facing ongoing challenges such as limited training data, handling rare words, and managing syntactic and semantic dependencies. This study introduces a multichannel character-level NMT model with hybrid attention for Arabic-English translation. The proposed approach addresses issues such as rare words and word alignment by encoding characters, incorporating Arabic word segmentation as handcrafted features, and using part-of-speech tagging in a multichannel CNN-BiLSTM encoder. The model then uses a Bi-LSTM decoder with hybrid attention to generate target language sentences. The proposed model was tested on a subset of the OPUS-100 dataset, achieving promising results.

PUBLICATION RECORD

Publication year
2024
Venue
Engineering, Technology & Applied Science Research
Publication date
2024-10-09
Fields of study
Not labeled
Identifiers
DOI 10.48084/etasr.8383
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Intelligent system for part-of-speech tagging using convolutional neural network on arabic language
2021cited by this paper
Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation
2020cited by this paper
Deep convolutional neural network based medical image classification for disease diagnosis
2019cited by this paper
Hybrid Attention for Chinese Character-Level Neural Machine Translation
2019cited by this paper
A Review on Machine Translation in Indian Languages
2018cited by this paper
Paraphrasing Arabic Metaphor with Neural Machine Translation
2018cited by this paper
A unified architecture for natural language processing: deep neural networks with multitask learning
2008cited by this paper
Statistical Phrase-Based Translation
2003cited by this paper
Long Short-Term Memory
1997cited by this paper

CITED BY

A Multi-Language NLP Model for Inclusive Digital Healthcare Marketing and Patient Communication
2025cites this paper
Dual attention-based hybrid deep learning framework for short text classification
2025cites this paper
GATmath and GATLc: Comprehensive benchmarks for evaluating Arabic large language models
2025cites this paper
A Robust Evaluation of Bug Pre-Processing and Classification Logic using NLP Computation with Machine Learning Technique
2025cites this paper
DEAST: A dataset for English-Arabic scientific translation and vice versa
2025cites this paper
Data in Brief
year unknowncites this paper