RoadFocusNet: road extraction from remote sensing imagery using focused transformer and focused masked image modeling

Hao Chen,Liangzhe Yang,Qingren Jia,Wei Xiong

Published 2025 in International Journal of Digital Earth

ABSTRACT

ABSTRACT Road extraction from remote sensing (RS) imagery is crucial for urban management, traffic planning, and autonomous driving. However, extracting accurate and complete roads remains challenging due to occlusions and severe class imbalance, where non-road regions dominate. To address these challenges, we propose a novel road extraction method incorporating two key components. The first is a Focused Masked Image Modeling (FocusMIM) strategy for data augmentation, which randomly masks road-related regions to efficiently model the latent dependency between occluded and non-occluded road parts. With FocusMIM, the model's ability to infer occluded roads is obviously improved. The second is a Focused Transformer (FocusFormer), which enhances road-related feature interactions through a Transformer-based encoder with Channel Self-Attention (CSA) modules and a Transformer decoder that leverages masked attention. The CSA modules aggregate global features of RS images to enhance contextual inference and mitigate occlusions. Meanwhile, the Transformer decoder employs a single road query that attends exclusively to road features, alleviating the class imbalance issue. Comprehensive experiments on the DeepGlobe Road, Massachusetts Road, and CHN6-CUG datasets demonstrate that our method outperforms several state-of-the-art methods, achieving an IoU increase of 0.96–5.38%. These results confirm the effectiveness of FocusMIM and FocusFormer in improving road continuity and reducing background interference.

PUBLICATION RECORD

Publication year
2025
Venue
International Journal of Digital Earth
Publication date
2025-09-03
Fields of study
Computer Science, Engineering, Environmental Science
Identifiers
DOI 10.1080/17538947.2025.2549435
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

DF-DRUNet: A decoder fusion model for automatic road extraction leveraging remote sensing images and GPS trajectory data
2024cited by this paper
Semantic-Spatial Collaborative Perception Network for Remote Sensing Image Captioning
2024cited by this paper
C²Net: Road Extraction via Context Perception and Cross Spatial-Scale Feature Interaction
2024cited by this paper
Global road extraction using a pseudo-label guided framework: from benchmark dataset to cross-region semi-supervised learning
2024influential reference
Occlusion-Aware Road Extraction Network for High-Resolution Remote Sensing Imagery
2024cited by this paper
Lightweight Cross-Modal Information Measure and Propagation for Road Extraction From Remote Sensing Image and Trajectory/LiDAR
2024cited by this paper
A Multiscale and Multidirection Feature Fusion Network for Road Detection From Satellite Imagery
2024cited by this paper
RADANet: Road Augmented Deformable Attention Network for Road Extraction From Complex High-Resolution Remote-Sensing Images
2023cited by this paper
RemainNet: Explore Road Extraction from Remote Sensing Image Using Mask Image Modeling
2023influential reference
A Semantics-Geometry Framework for Road Extraction From Remote Sensing Images
2023cited by this paper
Semi-MapGen: Translation of Remote Sensing Image Into Map via Semisupervised Adversarial Learning
2023cited by this paper
SemiRoadExNet: A semi-supervised network for road extraction from remote sensing imagery via adversarial learning
2023cited by this paper
Road Extraction From Satellite Imagery by Road Context and Full-Stage Feature
2023cited by this paper
Improving Road Surface Area Extraction via Semantic Segmentation with Conditional Generative Learning for Deep Inpainting Operations
2022cited by this paper
SW-GAN: Road Extraction from Remote Sensing Imagery Using Semi-Weakly Supervised Adversarial Learning
2022cited by this paper
NIGAN: A Framework for Mountain Road Extraction Integrating Remote Sensing Road-Scene Neighborhood Probability Enhancements and Improved Conditional Generative Adversarial Network
2022cited by this paper
BDTNet: Road Extraction by Bi-Direction Transformer From Remote Sensing Images
2022cited by this paper
Road extraction in remote sensing data: A survey
2022cited by this paper
Masked Frequency Modeling for Self-Supervised Visual Pre-Training
2022cited by this paper
TransRoadNet: A Novel Road Extraction Method for Remote Sensing Images via Combining High-Level Semantic Feature and Context
2022cited by this paper
What to Hide from Your Students: Attention-Guided Masked Image Modeling
2022cited by this paper
MSACon: Mining Spatial Attention-Based Contextual Information for Road Extraction
2022cited by this paper
Masked Autoencoders Are Scalable Vision Learners
2021cited by this paper
Reconstruction Bias U-Net for Road Extraction From Optical Remote Sensing Images
2021cited by this paper
Topo-Boundary: A Benchmark Dataset on Topological Road-Boundary Detection Using Aerial Images for Autonomous Driving
2021cited by this paper
GAMSNet: Globally aware road detection network with multi-scale residual learning
2021influential reference
A Global Context-aware and Batch-independent Network for road extraction from VHR satellite imagery
2021influential reference
MST: Masked Self-Supervised Transformer for Visual Representation
2021cited by this paper
DBRANet: Road Extraction by Dual-Branch Encoder and Regional Attention Decoder
2021cited by this paper
Simple Training Strategies and Model Scaling for Object Detection
2021influential reference
Road extraction using Aerial images for future Navigation
2021cited by this paper
Masked-attention Mask Transformer for Universal Image Segmentation
2021influential reference
End-to-End Object Detection with Transformers
2020cited by this paper
MACU-Net for Semantic Segmentation of Fine-Resolution Remotely Sensed Images
2020cited by this paper
An Ensemble Wasserstein Generative Adversarial Network Method for Road Extraction From High Resolution Remote Sensing Images in Rural Areas
2020cited by this paper
PointRend: Image Segmentation As Rendering
2019cited by this paper
Road Extraction from Very High Resolution Images Using Weakly labeled OpenStreetMap Centerline
2019cited by this paper
NL-LinkNet: Toward Lighter But More Accurate Road Extraction With Nonlocal Operations
2019cited by this paper
Extraction of road features from UAV images using a novel level set segmentation approach
2019cited by this paper
Vision-Based Road-Following Using Results of Semantic Segmentation for Autonomous Navigation
2019cited by this paper
Spatial Information Inference Net: Road Extraction Using Road-Specific Contextual Information
2019cited by this paper
D-LinkNet: LinkNet with Pretrained Encoder and Dilated Convolution for High Resolution Satellite Imagery Road Extraction
2018influential reference
DeepGlobe 2018: A Challenge to Parse the Earth through Satellite Images
2018influential reference
Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks
2017cited by this paper
Road Structure Refined CNN for Road Extraction in Aerial Image
2017cited by this paper
Road Extraction by Deep Residual U-Net
2017cited by this paper
Decoupled Weight Decay Regularization
2017cited by this paper
Big Data for Remote Sensing: Challenges and Opportunities
2016cited by this paper
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
2016cited by this paper
Pyramid Scene Parsing Network
2016cited by this paper
Feature Pyramid Networks for Object Detection
2016cited by this paper
Automatic Road Extraction From Remote Sensing Images Based on a Normalized Second Derivative Map
2015cited by this paper
U-Net: Convolutional Networks for Biomedical Image Segmentation
2015influential reference
Road network extraction: a neural-dynamic framework based on deep learning and a finite state machine
2015cited by this paper
Geospatial Big Data Handling Theory and Methods: A Review and Research Challenges
2015cited by this paper
Remote sensing big data computing: Challenges and opportunities
2015cited by this paper
A Higher-Order CRF Model for Road Network Extraction
2013cited by this paper
Machine Learning for Aerial Image Labeling
2013influential reference
Semi-Automated Road Detection From High Resolution Satellite Images by Directional Morphological Enhancement and Segmentation Techniques
2012cited by this paper
Use of Salient Features for the Design of a Multistage Framework to Extract Roads From High-Resolution Multispectral Satellite Images
2011cited by this paper
Application of a Fast Linear Feature Detector to Road Extraction From Remotely Sensed Imagery
2011cited by this paper
Advanced directional mathematical morphology for the detection of the road network in very high resolution remote sensing images
2010cited by this paper

CITED BY

Mapping the unmapped: deep learning ensembles and novel loss functions for scalable road segmentation from aerial imagery
2026cites this paper
Road Extraction: An Improved U-Net Based Approach with Attention Mechanisms and Multi-Scale Fusion for Satellite Imagery
2025cites this paper