Efficient Semantic Segmentation of Remote Sensing Images Through Global-Local Feature Integration

The rapid acquisition of remote sensing information plays a significant role in the development of image semantic segmentation methods for remote sensing image interpretation applications. With the increasing variety and complexity of data recorded by satellite remote sensing images, accurately and...

Full description

Saved in:
Bibliographic Details
Main Authors: Fengyi Zhang, Xiuyu Xia
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10990155/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1839633656711217152
author Fengyi Zhang
Xiuyu Xia
author_facet Fengyi Zhang
Xiuyu Xia
author_sort Fengyi Zhang
collection DOAJ
description The rapid acquisition of remote sensing information plays a significant role in the development of image semantic segmentation methods for remote sensing image interpretation applications. With the increasing variety and complexity of data recorded by satellite remote sensing images, accurately and effectively extracting information from remote sensing images has become crucial for interpreting these images using semantic segmentation methods. However, natural scene-based semantic segmentation methods often suffer from low segmentation accuracy when applied to high-resolution remote sensing images due to the complex background, diverse scales, and irregular shapes. To address these challenges, this paper proposes an efficient remote sensing image semantic segmentation model called Multi-GLISS, which integrates global and local features. The model captures global features through consecutive downsampling and Fourier transform while preserving spatial feature learning and boundary information using convolutional residual layers. Moreover, the local wavelet block utilizes wavelet transform to extract multi-scale local features, enhancing sensitivity to local textures and shapes. Subsequently, the model gradually increases the image resolution through a series of upsampling steps, reconstructing a high-resolution version of the image. Experiments on the Vaihingen dataset show that Multi-GLISS achieves a mean F1 score of 93.29, outperforming the baseline DeepLabV3+ by 1.17 points, while providing clear boundary and structural information for accurate and detailed segmentation.
format Article
id doaj-art-3c70db811f8544e1bd6fd346e125e80c
institution Matheson Library
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-3c70db811f8544e1bd6fd346e125e80c2025-07-10T23:00:49ZengIEEEIEEE Access2169-35362025-01-011311565311566810.1109/ACCESS.2025.356780610990155Efficient Semantic Segmentation of Remote Sensing Images Through Global-Local Feature IntegrationFengyi Zhang0https://orcid.org/0009-0004-8476-3028Xiuyu Xia1College of Information Engineering, Chengdu Vocational and Technical College of Industry, Chengdu, ChinaCollege of Electronics and Information Engineering, Sichuan University, Chengdu, ChinaThe rapid acquisition of remote sensing information plays a significant role in the development of image semantic segmentation methods for remote sensing image interpretation applications. With the increasing variety and complexity of data recorded by satellite remote sensing images, accurately and effectively extracting information from remote sensing images has become crucial for interpreting these images using semantic segmentation methods. However, natural scene-based semantic segmentation methods often suffer from low segmentation accuracy when applied to high-resolution remote sensing images due to the complex background, diverse scales, and irregular shapes. To address these challenges, this paper proposes an efficient remote sensing image semantic segmentation model called Multi-GLISS, which integrates global and local features. The model captures global features through consecutive downsampling and Fourier transform while preserving spatial feature learning and boundary information using convolutional residual layers. Moreover, the local wavelet block utilizes wavelet transform to extract multi-scale local features, enhancing sensitivity to local textures and shapes. Subsequently, the model gradually increases the image resolution through a series of upsampling steps, reconstructing a high-resolution version of the image. Experiments on the Vaihingen dataset show that Multi-GLISS achieves a mean F1 score of 93.29, outperforming the baseline DeepLabV3+ by 1.17 points, while providing clear boundary and structural information for accurate and detailed segmentation.https://ieeexplore.ieee.org/document/10990155/Remote sensing image segmentationFourier domainmulti-scale convolution
spellingShingle Fengyi Zhang
Xiuyu Xia
Efficient Semantic Segmentation of Remote Sensing Images Through Global-Local Feature Integration
IEEE Access
Remote sensing image segmentation
Fourier domain
multi-scale convolution
title Efficient Semantic Segmentation of Remote Sensing Images Through Global-Local Feature Integration
title_full Efficient Semantic Segmentation of Remote Sensing Images Through Global-Local Feature Integration
title_fullStr Efficient Semantic Segmentation of Remote Sensing Images Through Global-Local Feature Integration
title_full_unstemmed Efficient Semantic Segmentation of Remote Sensing Images Through Global-Local Feature Integration
title_short Efficient Semantic Segmentation of Remote Sensing Images Through Global-Local Feature Integration
title_sort efficient semantic segmentation of remote sensing images through global local feature integration
topic Remote sensing image segmentation
Fourier domain
multi-scale convolution
url https://ieeexplore.ieee.org/document/10990155/
work_keys_str_mv AT fengyizhang efficientsemanticsegmentationofremotesensingimagesthroughgloballocalfeatureintegration
AT xiuyuxia efficientsemanticsegmentationofremotesensingimagesthroughgloballocalfeatureintegration