CodeAttack: Code-based Adversarial Attacks for Pre-Trained Programming Language Models

Published 2022 in AAAI Conference on Artificial Intelligence

ABSTRACT

Pre-trained programming language (PL) models (such as CodeT5, CodeBERT, GraphCodeBERT, etc.,) have the potential to automate software engineering tasks involving code understanding and code generation. However, these models operate in the natural channel of code, i.e., primarily concerned with the human understanding of code. They are not robust to changes in the input and thus, are potentially susceptible to adversarial attacks in the natural channel. We propose, Code Attack, a simple yet effective black-box attack model that uses code structure to generate effective, efficient, and imperceptible adversarial code samples and demonstrates the vulnerabilities of the state-of-the-art PL models to code-specific adversarial attacks. We evaluate the transferability of CodeAttack on several code-code (translation and repair) and code-NL (summarization) tasks across different programming languages. Code Attack outperforms state-of-the-art adversarial NLP attack models to achieve the best overall drop in performance while being more efficient, imperceptible, consistent, and fluent. The code can be found at https://github.com/reddy-lab-code-research/CodeAttack.

PUBLICATION RECORD

Publication year
2022
Venue
AAAI Conference on Artificial Intelligence
Publication date
2022-05-31
Fields of study
Computer Science
Identifiers
DOI 10.48550/arXiv.2206.00052 arXiv 2206.00052
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Natural Attack for Pre-trained Models of Code
2022cited by this paper
Diet code is healthy: simplifying programs for pre-trained models of code
2022influential reference
NatGen: generative pre-training by “naturalizing” source code
2022cited by this paper
StructCoder: Structure-Aware Transformer for Code Generation
2022cited by this paper
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
2021cited by this paper
Adversarial Robustness of Deep Code Comment Generation
2021cited by this paper
Unified Pre-training for Program Understanding and Generation
2021cited by this paper
DOBF: A Deobfuscation Pre-Training Objective for Programming Languages
2021cited by this paper
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation
2021cited by this paper
Project CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks
2021cited by this paper
Assessing Robustness of ML-Based Program Analysis Tools using Metamorphic Program Transformations
2021cited by this paper
Semantic Robustness of Models of Source Code
2020cited by this paper
CodeBLEU: a Method for Automatic Evaluation of Code Synthesis
2020cited by this paper
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
2020influential reference
Generating Adversarial Examples for Holding Robustness of Source Code Processing Models
2020cited by this paper
BAE: BERT-based Adversarial Examples for Text Classification
2020cited by this paper
GraphCodeBERT: Pre-training Code Representations with Data Flow
2020influential reference
A Theory of Dual Channel Constraints
2020cited by this paper
BERT-ATTACK: Adversarial Attack against BERT Using BERT
2020influential reference
CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
2019cited by this paper
Combating Adversarial Misspellings with Robust Word Recognition
2019cited by this paper
On the Robustness of Self-Attentive Models
2019cited by this paper
Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency
2019cited by this paper
RoBERTa: A Robustly Optimized BERT Pretraining Approach
2019cited by this paper
Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment
2019cited by this paper
Devign: Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks
2019cited by this paper
Adversarial examples for models of code
2019cited by this paper
Black-Box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers
2018cited by this paper
Generating Natural Language Adversarial Examples
2018cited by this paper
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
2018cited by this paper
TextBugger: Generating Adversarial Text Against Real-world Applications
2018cited by this paper
Greedy Attack and Gumbel Attack: Generating Adversarial Examples for Discrete Data
2018cited by this paper
An Empirical Study on Learning Bug-Fixing Patches in the Wild via Neural Machine Translation
2018cited by this paper
RACE: Large-scale ReAding Comprehension Dataset From Examinations
2017cited by this paper
HotFlip: White-Box Adversarial Examples for Text Classification
2017cited by this paper
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
2016cited by this paper
Technical Report on the CleverHans v2.1.0 Adversarial Examples Library
2016cited by this paper
SQuAD: 100,000+ Questions for Machine Comprehension of Text
2016cited by this paper
Improving automated source code summarization via an eye-tracking study of programmers
2014cited by this paper
On the naturalness of software
2012cited by this paper
Bleu: a Method for Automatic Evaluation of Machine Translation
2002cited by this paper
Equation of State Calculations by Fast Computing Machines
1953cited by this paper

CITED BY

How Secure is Secure Code Generation? Adversarial Prompts Put LLM Defenses to the Test
2026cites this paper
ICON: Intent-Context Coupling for Efficient Multi-Turn Jailbreak Attack
2026cites this paper
HogVul: Black-box Adversarial Code Generation Framework Against LM-based Vulnerability Detectors
2026cites this paper
Improving the Robustness of Large Language Models for Code Tasks via Fine-tuning with Perturbed Data
2026cites this paper
XOXO: Stealthy Cross-Origin Context Poisoning Attacks against AI Coding Assistants
2025cites this paper
White-box structure analysis of pre-trained language models of code for effective attacking
2025cites this paper
A Multi-Language Perspective on the Robustness of LLM Code Generation
2025cites this paper
Evaluate-and-Purify: Fortifying Code Language Models Against Adversarial Attacks Using LLM-as-a-Judge
2025cites this paper
A Systematic Literature Review on Neural Code Translation
2025cites this paper
Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement
2025cites this paper
EVALOOOP: A Self-Consistency-Centered Framework for Assessing Large Language Model Robustness in Programming
2025cites this paper
Efficient black-box attack with surrogate models and multiple universal adversarial perturbations
2025cites this paper
Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models
2025cites this paper
Reinforcement Learning vs Supervised Learning: A tug of war to generate refactored code accurately
2025cites this paper
Seven Security Challenges That Must be Solved in Cross-domain Multi-agent LLM Systems
2025cites this paper
Evaluating and Improving Robustness in Large Language Models: A Survey and Future Directions
2025cites this paper
Iterative Generation of Adversarial Example for Deep Code Models
2025cites this paper
Random Perturbation Attack on LLMs for Code Generation
2025cites this paper
A Causal Learning Framework for Enhancing Robustness of Source Code Models
2025cites this paper
Enhancing vulnerability detection by fusing code semantic features with LLM-generated explanations
2025cites this paper
A Method for Generating Adversarial Code Samples Based on Dynamic Weight Awareness and Rollback Control
2025cites this paper
Smaller = Weaker? Benchmarking Robustness of Quantized LLMs in Code Generation
2025cites this paper
Paper Summary Attack: Jailbreaking LLMs through LLM Safety Papers
2025cites this paper
Explicit Vulnerability Generation with LLMs: An Investigation Beyond Adversarial Attacks
2025cites this paper
MOCHA: Are Code Language Models Robust Against Multi-Turn Malicious Coding Prompts?
2025cites this paper
Adversarial Defence without Adversarial Defence: Enhancing Language Model Robustness via Instance-level Principal Component Removal
2025cites this paper
RedCoder: Automated Multi-Turn Red Teaming for Code LLMs
2025cites this paper
SDGO: Self-Discrimination-Guided Optimization for Consistent Safety in Large Language Models
2025influential citation
SoK: Understanding (New) Security Issues Across AI4Code Use Cases
2025cites this paper
A Systematic Study of Code Obfuscation Against LLM-based Vulnerability Detection
2025influential citation
Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs
2025cites this paper
Pluralistic Behavior Suite: Stress-Testing Multi-Turn Adherence to Custom Behavioral Policies
2025cites this paper
Adversarial Defense without Adversarial Defense: Enhancing Language Model Robustness via Instance-level Principal Component Removal
2025cites this paper
Attention-guided adversarial sample generation for robust webshell detection
2025cites this paper
HarmNet: A Framework for Adaptive Multi-Turn Jailbreak Attacks on Large Language Models
2025cites this paper
Decomposition then watermarking: Enhancing code traceability with dual-channel code watermarking
2025cites this paper
HardVD: High-capacity cross-modal adversarial reprogramming for data-efficient vulnerability detection
2025cites this paper
CodeSearchAttack: Enhancing soft-label black-box adversarial attacks on code
2025cites this paper
AutoRed: A Free-form Adversarial Prompt Generation Framework for Automated Red Teaming
2025cites this paper
NEXUS: Network Exploration for eXploiting Unsafe Sequences in Multi-Turn LLM Jailbreaks
2025cites this paper
RedCodeAgent: Automatic Red-teaming Agent against Diverse Code Agents
2025cites this paper
Misactivation-Aware Stealthy Backdoor Attacks on Neural Code Understanding Models
2025cites this paper
"Digital Camouflage": The LLVM Challenge in LLM-Based Malware Detection
2025cites this paper
Understanding and Improving Flaky Test Classification
2025cites this paper
SCodeGen: A Real-Time Trustworthy Constrained Decoding Framework for Secure Code Generation with LLMs
2025cites this paper
DrainCode: Stealthy Energy Consumption Attacks on Retrieval-Augmented Code Generation via Context Poisoning
2025cites this paper
MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models
2025cites this paper
Machine Learning Solutions to Automated Governance and Code Analysis to Compliance
2025cites this paper
Towards cost-efficient vulnerability detection with cross-modal adversarial reprogramming
2025cites this paper
Assessing and improving syntactic adversarial robustness of pre-trained models for code translation
2025cites this paper
Variable Renaming-Based Adversarial Test Generation for Code Model: Benchmark and Enhancement
2025influential citation
From Triumph to Uncertainty: The Journey of Software Engineering in the AI Era
2024cites this paper
Studying Vulnerable Code Entities in R
2024influential citation
Do Large Code Models Understand Programming Concepts? Counterfactual Analysis for Code Predicates
2024cites this paper
Measuring Impacts of Poisoning on Model Parameters and Neuron Activations: A Case Study of Poisoning CodeBERT
2024cites this paper
Token Alignment via Character Matching for Subword Completion
2024cites this paper
Software Vulnerability and Functionality Assessment using LLMs
2024cites this paper
CodeFort: Robust Training for Code Generation Models
2024cites this paper
A systematic literature review on the impact of AI models on the security of code generation
2024influential citation
Exploiting the Adversarial Example Vulnerability of Transfer Learning of Source Code
2024influential citation
ALANCA: Active Learning Guided Adversarial Attacks for Code Comprehension on Diverse Pre-trained and Large Language Models
2024cites this paper
Compressed Models are NOT Miniature Versions of Large Models
2024cites this paper
Fluent Student-Teacher Redteaming
2024cites this paper
Purpose Enhanced Reasoning through Iterative Prompting: Uncover Latent Robustness of ChatGPT on Code Comprehension
2024cites this paper
Investigating adversarial attacks in software analytics via machine learning explainability
2024influential citation
A survey on robustness attacks for deep code models
2024cites this paper
What can Large Language Models Capture about Code Functional Equivalence?
2024cites this paper
CARL: Unsupervised Code-Based Adversarial Attacks for Programming Language Models via Reinforcement Learning
2024influential citation
SrcMarker: Dual-Channel Source Code Watermarking via Scalable Code Transformations
2024cites this paper
Rethinking the Role of Structural Information: How It Enhances Code Representation Learning?
2024cites this paper
Demonstration Attack against In-Context Learning for Code Intelligence
2024cites this paper
Attacks and Defenses for Large Language Models on Coding Tasks
2024cites this paper
How Effectively Do Code Language Models Understand Poor-Readability Code?
2024cites this paper
AaceGEN: Attention Guided Adversarial Code Example Generation for Deep Code Models
2024influential citation
Attribution-guided Adversarial Code Prompt Generation for Code Completion Models
2024cites this paper
Adversarial Training: A Survey
2024cites this paper
Defect-Introducing Defect Prediction Testing
2024cites this paper
A Survey on Adversarial Machine Learning for Code Data: Realistic Threats, Countermeasures, and Interpretations
2024influential citation
On the Adversarial Robustness of Instruction-Tuned Large Language Models for Code
2024cites this paper
Stealthy Backdoor Attack for Code Models
2023cites this paper
Evaluating and Enhancing the Robustness of Code Pre-trained Models through Structure-Aware Adversarial Samples Generation
2023cites this paper
Intervention-Based Alignment of Code Search with Execution Feedback
2023cites this paper
DeceptPrompt: Exploiting LLM-driven Code Generation via Adversarial Natural Language Instructions
2023cites this paper
Occlusion-based Detection of Trojan-triggering Inputs in Large Language Models of Code
2023cites this paper
RNNS: Representation Nearest Neighbor Search Black-Box Attack on Code Models
2023cites this paper
Assessing and Improving Syntactic Adversarial Robustness of Pre-trained Models for Code Translation
2023cites this paper
Gotcha! This Model Uses My Code! Evaluating Membership Leakage Risks in Code Models
2023cites this paper
Trustworthy and Synergistic Artificial Intelligence for Software Engineering: Vision and Roadmaps
2023cites this paper
Towards Code Watermarking with Dual-Channel Transformations
2023cites this paper
Adversarial Attacks on Code Models with Discriminative Graph Patterns
2023cites this paper
On-the-fly Improving Performance of Deep Code Models via Input Denoising
2023cites this paper
Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning Attacks
2023cites this paper
DIP: Dead code Insertion based Black-box Attack for Programming Language Model
2023cites this paper
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
2023cites this paper
Enhancing robustness of AI offensive code generators via data augmentation
2023cites this paper
A Black-Box Attack on Code Models via Representation Nearest Neighbor Search
2023cites this paper
Execution-based Code Generation using Deep Reinforcement Learning
2023cites this paper
Code Difference Guided Adversarial Example Generation for Deep Code Models
2023cites this paper
On Extracting Specialized Code Abilities from Large Language Models: A Feasibility Study
2023influential citation
Transformers Meet Directed Graphs
2023cites this paper