Pseudocode Generation from Source Code Using the BART Model

Anas Alokla,Walaa K. Gad,Waleed Nazih,M. Aref,Abdel-Badeeh M. Salem

Published 2022 in Mathematics

ABSTRACT

In the software development process, more than one developer may work on developing the same program and bugs in the program may be fixed by a different developer; therefore, understanding the source code is an important issue. Pseudocode plays an important role in solving this problem, as it helps the developer to understand the source code. Recently, transformer-based pre-trained models achieved remarkable results in machine translation, which is similar to pseudocode generation. In this paper, we propose a novel automatic pseudocode generation from the source code based on a pre-trained Bidirectional and Auto-Regressive Transformer (BART) model. We fine-tuned two pre-trained BART models (i.e., large and base) using a dataset containing source code and its equivalent pseudocode. In addition, two benchmark datasets (i.e., Django and SPoC) were used to evaluate the proposed model. The proposed model based on the BART large model outperforms other state-of-the-art models in terms of BLEU measurement by 15% and 27% for Django and SPoC datasets, respectively.

PUBLICATION RECORD

Publication year
2022
Venue
Mathematics
Publication date
2022-10-25
Fields of study
Not labeled
Identifiers
DOI 10.3390/math10213967
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

DLBT: Deep Learning-Based Transformer to Generate Pseudo-Code from Source Code
2022influential reference
Retrieval-Based Transformer Pseudocode Generation
2022influential reference
SPT-Code: Sequence-to-Sequence Pre-Training for Learning Source Code Representations
2022cited by this paper
Modeling Hierarchical Syntax Structure with Triplet Position for Source Code Summarization
2022cited by this paper
Fine-grained Pseudo-code Generation Method via Code Feature Extraction and Transformer
2021cited by this paper
GraphCodeBERT: Pre-training Code Representations with Data Flow
2020cited by this paper
Neural Machine Translation
2020cited by this paper
Language Models are Few-Shot Learners
2020cited by this paper
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
2020cited by this paper
Retrieval-based Neural Source Code Summarization
2020cited by this paper
A new approach for the vanishing gradient problem on sigmoid activation
2020cited by this paper
5分で分かる!? 有名論文ナナメ読み：Jacob Devlin et al. : BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding
2020influential reference
From Code to Natural Language: Type-Aware Sketch-Based Seq2Seq Learning
2020influential reference
Learning-Based Recursive Aggregation of Abstract Syntax Trees for Code Clone Detection
2019cited by this paper
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019cited by this paper
Revisiting Low-Resource Neural Machine Translation: A Case Study
2019cited by this paper
SPoC: Search-based Pseudocode to Code
2019influential reference
Generation of Pseudo Code from the Python Source Code using Rule-Based Machine Translation
2019influential reference
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
2019cited by this paper
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
2019cited by this paper
Improving Language Understanding by Generative Pre-Training
2018cited by this paper
Measuring Program Comprehension: A Large-Scale Field Study with Professionals
2018cited by this paper
A Structured Review of the Validity of BLEU
2018cited by this paper
Guiding Neural Machine Translation with Retrieved Translation Pieces
2018cited by this paper
Generating Pseudo-Code from Source Code Using Deep Learning
2018cited by this paper
SMT vs NMT: A Comparison over Hindi and Bengali Simple Sentences
2018cited by this paper
Attention is All you Need
2017cited by this paper
Inductive Representation Learning on Large Graphs
2017cited by this paper
Incorporating Copying Mechanism in Sequence-to-Sequence Learning
2016cited by this paper
Language Modeling with Gated Convolutional Networks
2016cited by this paper
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2016cited by this paper
Deep Residual Learning for Image Recognition
2015influential reference
Learning to Generate Pseudo-Code from Source Code Using Statistical Machine Translation (T)
2015cited by this paper
Statistical Machine Translation
2014cited by this paper
Understanding the exploding gradient problem
2012cited by this paper
Long Short-Term Memory
1997cited by this paper
Program Comprehension During Software Maintenance and Evolution
1995cited by this paper

CITED BY

Hybrid Graph-Transformer Framework for Intelligent Pseudocode to Source Code Synthesis
2025cites this paper
HYLR-FO: Hybrid Approach Using Language Models and Rule-Based Systems for On-Device Food Ordering
2025cites this paper
Automatic Paraphrase Generation at Phrasal, and Sentence Level for Urdu Language: Data and Methods
2025cites this paper
Beyond SEO: A Transformer-Based Approach for Reinventing Web Content Optimisation
2025cites this paper
Beyond Traditional Algorithms: Leveraging LLMs for Accurate Cross-Border Entity Identification
2025cites this paper
Code Semantic Zooming
2025cites this paper
Guiding ChatGPT for Better Code Generation: An Empirical Study
2024cites this paper
Text Summarization of Batak Toba Language Using Bidirectional Auto-Regressive Transformers Model
2024cites this paper
Design of an efficient Transformer-XL model for enhanced pseudo code to Python code conversion
2024cites this paper
Generative AI and future education: a review, theoretical validation, and authors’ perspective on challenges and solutions
2024cites this paper
Transformer technology in molecular science
2024cites this paper
Generating Headlines from Article Summaries Using Transformer Models
2024cites this paper
Improving ChatGPT Prompt for Code Generation
2023influential citation