Enhancing semantic segmentation of Ecuadorian shrimp ponds through fine-tuned vision transformers and U-Net architectures utilizing open-source remote sensing data

Aquaculture has emerged as an important pillar of global food production, and shrimp farming plays a critical role in fulfilling the growing demand for seafood. This is especially true in Ecuador, which is recognized as one of the world's largest exporters and producers of shrimp. However, conv...

Full description

Saved in:

Bibliographic Details
Main Authors:	Daniel Jacome, Jianghao Wang, Yong Ge
Format:	Article
Language:	English
Published:	Taylor & Francis Group 2025-12-01
Series:	International Journal of Digital Earth
Subjects:	Pond segmentation remote sensing ViT U-Net fine-tuning
Online Access:	https://www.tandfonline.com/doi/10.1080/17538947.2025.2538214
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1839612834112077824
author	Daniel Jacome Jianghao Wang Yong Ge
author_facet	Daniel Jacome Jianghao Wang Yong Ge
author_sort	Daniel Jacome
collection	DOAJ
description	Aquaculture has emerged as an important pillar of global food production, and shrimp farming plays a critical role in fulfilling the growing demand for seafood. This is especially true in Ecuador, which is recognized as one of the world's largest exporters and producers of shrimp. However, conventional shrimp pond monitoring has limitations owing to the extensive scale and operational complexity. Traditional methods using low-resolution imagery and ground surveys are hampered by cloud cover, outdated maps, and insufficient temporal resolution, leading to inaccurate pond area estimations and hindering timely management. Our framework accurately segmented shrimp ponds from high-resolution satellite images. Using a fine-tuned Prithvi 100M model, we achieved a state-of-the-art mIoU of 0.970 and 0.993 accuracy, respectively. This significantly surpasses other models, such as ViT-base (mIoU = 0.878) and U-Net variants (mIoU = 0.949). The pre-training of the Prithvi 100M model allowed it to effectively capture intricate pond boundaries and subtle internal structures, resulting in highly accurate and detailed segmentation masks. Fine-tuning the encoder proved to be the most effective (mIoU = 0.991), whereas standard data augmentation negatively impacted the performance. This methodology offers a valuable tool for enhancing water resource management and promoting sustainable aquaculture practices in Ecuadorian coastal regions.
format	Article
id	doaj-art-ee21d6b9c7544d13b3500d942cf35d6a
institution	Matheson Library
issn	1753-8947 1753-8955
language	English
publishDate	2025-12-01
publisher	Taylor & Francis Group
record_format	Article
series	International Journal of Digital Earth
spelling	doaj-art-ee21d6b9c7544d13b3500d942cf35d6a2025-07-28T00:00:49ZengTaylor & Francis GroupInternational Journal of Digital Earth1753-89471753-89552025-12-0118110.1080/17538947.2025.2538214Enhancing semantic segmentation of Ecuadorian shrimp ponds through fine-tuned vision transformers and U-Net architectures utilizing open-source remote sensing dataDaniel Jacome0Jianghao Wang1Yong Ge2State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, People’s Republic of ChinaState Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, People’s Republic of ChinaState Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, People’s Republic of ChinaAquaculture has emerged as an important pillar of global food production, and shrimp farming plays a critical role in fulfilling the growing demand for seafood. This is especially true in Ecuador, which is recognized as one of the world's largest exporters and producers of shrimp. However, conventional shrimp pond monitoring has limitations owing to the extensive scale and operational complexity. Traditional methods using low-resolution imagery and ground surveys are hampered by cloud cover, outdated maps, and insufficient temporal resolution, leading to inaccurate pond area estimations and hindering timely management. Our framework accurately segmented shrimp ponds from high-resolution satellite images. Using a fine-tuned Prithvi 100M model, we achieved a state-of-the-art mIoU of 0.970 and 0.993 accuracy, respectively. This significantly surpasses other models, such as ViT-base (mIoU = 0.878) and U-Net variants (mIoU = 0.949). The pre-training of the Prithvi 100M model allowed it to effectively capture intricate pond boundaries and subtle internal structures, resulting in highly accurate and detailed segmentation masks. Fine-tuning the encoder proved to be the most effective (mIoU = 0.991), whereas standard data augmentation negatively impacted the performance. This methodology offers a valuable tool for enhancing water resource management and promoting sustainable aquaculture practices in Ecuadorian coastal regions.https://www.tandfonline.com/doi/10.1080/17538947.2025.2538214Pond segmentationremote sensingViTU-Netfine-tuning
spellingShingle	Daniel Jacome Jianghao Wang Yong Ge Enhancing semantic segmentation of Ecuadorian shrimp ponds through fine-tuned vision transformers and U-Net architectures utilizing open-source remote sensing data International Journal of Digital Earth Pond segmentation remote sensing ViT U-Net fine-tuning
title	Enhancing semantic segmentation of Ecuadorian shrimp ponds through fine-tuned vision transformers and U-Net architectures utilizing open-source remote sensing data
title_full	Enhancing semantic segmentation of Ecuadorian shrimp ponds through fine-tuned vision transformers and U-Net architectures utilizing open-source remote sensing data
title_fullStr	Enhancing semantic segmentation of Ecuadorian shrimp ponds through fine-tuned vision transformers and U-Net architectures utilizing open-source remote sensing data
title_full_unstemmed	Enhancing semantic segmentation of Ecuadorian shrimp ponds through fine-tuned vision transformers and U-Net architectures utilizing open-source remote sensing data
title_short	Enhancing semantic segmentation of Ecuadorian shrimp ponds through fine-tuned vision transformers and U-Net architectures utilizing open-source remote sensing data
title_sort	enhancing semantic segmentation of ecuadorian shrimp ponds through fine tuned vision transformers and u net architectures utilizing open source remote sensing data
topic	Pond segmentation remote sensing ViT U-Net fine-tuning
url	https://www.tandfonline.com/doi/10.1080/17538947.2025.2538214
work_keys_str_mv	AT danieljacome enhancingsemanticsegmentationofecuadorianshrimppondsthroughfinetunedvisiontransformersandunetarchitecturesutilizingopensourceremotesensingdata AT jianghaowang enhancingsemanticsegmentationofecuadorianshrimppondsthroughfinetunedvisiontransformersandunetarchitecturesutilizingopensourceremotesensingdata AT yongge enhancingsemanticsegmentationofecuadorianshrimppondsthroughfinetunedvisiontransformersandunetarchitecturesutilizingopensourceremotesensingdata

Enhancing semantic segmentation of Ecuadorian shrimp ponds through fine-tuned vision transformers and U-Net architectures utilizing open-source remote sensing data

Similar Items