Decision SpikeFormer: Spike-Driven Transformer for Decision Making

Published 2025 in Computer Vision and Pattern Recognition

ABSTRACT

Offline reinforcement learning (RL) enables policy training solely on pre-collected data, avoiding direct environment interaction—a crucial benefit for energy-constrained embodied AI applications. Although Artificial Neural Networks (ANN)-based methods perform well in offline RL, their high computational and energy demands motivate exploration of more efficient alternatives. Spiking Neural Networks (SNNs) show promise for such tasks, given their low power consumption. In this work, we introduce DSFormer, the first spike-driven transformer model designed to tackle offline RL via sequence modeling. Unlike existing SNN transformers focused on spatial dimensions for vision tasks, we develop Temporal Spiking Self-Attention (TSSA) and Positional Spiking Self-Attention (PSSA) in DSFormer to capture the temporal and positional dependencies essential for sequence modeling in RL. Additionally, we propose Progressive Threshold-dependent Batch Normalization (PTBN), which combines the benefits of LayerNorm and BatchNorm to preserve temporal dependencies while maintaining the spiking nature of SNNs. Comprehensive results in the D4RL benchmark show DSFormer’s superiority over both SNN and ANN counterparts, achieving 78.4% energy savings, highlighting DSFormer’s advantages not only in energy efficiency but also in competitive performance. Code and models are public at project page.

PUBLICATION RECORD

Publication year
2025
Venue
Computer Vision and Pattern Recognition
Publication date
2025-04-04
Fields of study
Computer Science
Identifiers
DOI 10.1109/CVPR52734.2025.01792 arXiv 2504.03800
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

One-step Spiking Transformer with a Linear Complexity
2024cited by this paper
Directly Training Temporal Spiking Neural Network with Sparse Surrogate Gradient
2024cited by this paper
SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms
2024cited by this paper
Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning
2024cited by this paper
Spiking Convolutional Neural Networks for Text Classification
2024cited by this paper
Q-value Regularized Transformer for Offline Reinforcement Learning
2024cited by this paper
SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization
2024cited by this paper
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips
2024cited by this paper
QKFormer: Hierarchical Spiking Transformer using Q-K Attention
2024cited by this paper
Imitation Learning: A Survey of Learning Methods, Environments and Metrics
2024cited by this paper
SNN-BERT: Training-efficient Spiking Neural Networks for energy-efficient BERT
2024cited by this paper
Brain-Inspired Computing: A Systematic Survey and Future Trends
2024cited by this paper
Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection
2024cited by this paper
Enhancing the Performance of Transformer-based Spiking Neural Networks by SNN-optimized Downsampling with Precise Gradient Backpropagation
2023cited by this paper
Spikingformer: Spike-driven Residual Learning for Transformer-based Spiking Neural Network
2023cited by this paper
Offline Pre-trained Multi-agent Decision Transformer
2023cited by this paper
Toward robust and scalable deep spiking reinforcement learning
2023cited by this paper
SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks
2023cited by this paper
Critic-Guided Decision Transformer for Offline Reinforcement Learning
2023cited by this paper
SpikeBERT: A language spikformer learned from BERT with knowledge distillation
2023cited by this paper
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making
2023cited by this paper
RMP-Loss: Regularizing Membrane Potential Distribution for Spiking Neural Networks
2023cited by this paper
Learnable Surrogate Gradient for Direct Training Spiking Neural Networks
2023cited by this paper
Deep Directly-Trained Spiking Neural Networks for Object Detection
2023cited by this paper
Elastic Decision Transformer
2023cited by this paper
Spike-driven Transformer
2023cited by this paper
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
2023cited by this paper
Large sequence models for sequential decision-making: a survey
2023cited by this paper
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning
2023cited by this paper
Prompt-Tuning Decision Transformer with Preference Ranking
2023cited by this paper
The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022
2022cited by this paper
Temporal Efficient Training of Spiking Neural Network via Gradient Re-weighting
2022cited by this paper
When does return-conditioned supervised learning work for offline reinforcement learning?
2022cited by this paper
Mildly Conservative Q-Learning for Offline Reinforcement Learning
2022cited by this paper
Spike Calibration: Fast and Accurate Conversion of Spiking Neural Network for Object Detection and Segmentation
2022cited by this paper
Spikformer: When Spiking Neural Network Meets Transformer
2022cited by this paper
MetaFormer Baselines for Vision
2022cited by this paper
How Crucial is Transformer in Decision Transformer?
2022cited by this paper
RT-1: Robotics Transformer for Real-World Control at Scale
2022cited by this paper
Offline Reinforcement Learning with Fisher Divergence Critic Regularization
2021cited by this paper
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning
2021cited by this paper
RvS: What is Essential for Offline RL via Supervised Learning?
2021cited by this paper
Quantile Filtered Imitation Learning
2021cited by this paper
Advancing Spiking Neural Networks Toward Deep Residual Learning
2021cited by this paper
Offline Reinforcement Learning with Implicit Q-Learning
2021cited by this paper
Deep Residual Learning in Spiking Neural Networks
2021cited by this paper
Offline Reinforcement Learning as One Big Sequence Modeling Problem
2021cited by this paper
Decision Transformer: Reinforcement Learning via Sequence Modeling
2021influential reference
An Attention Free Transformer
2021cited by this paper
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
2020cited by this paper
Going Deeper With Directly-Trained Larger Spiking Neural Networks
2020cited by this paper
Strategy and Benchmark for Converting Deep Q-Networks to Event-Driven Spiking Neural Networks
2020cited by this paper
Conservative Q-Learning for Offline Reinforcement Learning
2020cited by this paper
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
2020cited by this paper
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
2020cited by this paper
SPIKING NEURAL NETWORKS
2020cited by this paper
Spiking-YOLO: Spiking Neural Network for Energy-Efficient Object Detection
2019cited by this paper
Towards spike-based machine intelligence with neuromorphic computing
2019cited by this paper
Behavior Regularized Offline Reinforcement Learning
2019cited by this paper
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
2019cited by this paper
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
2019cited by this paper
Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to Atari Breakout game
2019cited by this paper
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
2019cited by this paper
Exponentially Weighted Imitation Learning for Batched Historical Data
2018cited by this paper
Loihi: A Neuromorphic Manycore Processor with On-Chip Learning
2018cited by this paper
Off-Policy Deep Reinforcement Learning without Exploration
2018cited by this paper
Behavioral Cloning from Observation
2018cited by this paper
Spatio-Temporal Backpropagation for Training High-Performance Spiking Neural Networks
2017cited by this paper
Attention is All you Need
2017cited by this paper
Bridging the Gap Between Value and Policy Based Reinforcement Learning
2017cited by this paper
1.1 Computing's energy problem (and what we can do about it)
2014cited by this paper
Networks of Spiking Neurons: The Third Generation of Neural Network Models
1996cited by this paper

CITED BY

ASG-TDM: A Graph-Enhanced Transformer With Spiking Preprocessing and a Dendritic Head for Multi-Task Facial Analysis
2026cites this paper
Edge Intelligence with Spiking Neural Networks
2025cites this paper