Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning

The exploitation of deep-sea polymetallic nodules has attracted global attention. To mitigate its impact on deep-sea ecosystems, accurate identification of benthic megafauna is essential for developing science-based mining strategies. Deep learning has emerged as an promising approach in biological...

Full description

Saved in:
Bibliographic Details
Main Authors: Guofan Long, Wei Song, Xiangchun Liu, Ziyao Fang, Jinqi An, Kun Liu, Yaqin Huang, Xuebao He
Format: Article
Language:English
Published: Elsevier 2025-12-01
Series:Ecological Informatics
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1574954125003280
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1839630876426633216
author Guofan Long
Wei Song
Xiangchun Liu
Ziyao Fang
Jinqi An
Kun Liu
Yaqin Huang
Xuebao He
author_facet Guofan Long
Wei Song
Xiangchun Liu
Ziyao Fang
Jinqi An
Kun Liu
Yaqin Huang
Xuebao He
author_sort Guofan Long
collection DOAJ
description The exploitation of deep-sea polymetallic nodules has attracted global attention. To mitigate its impact on deep-sea ecosystems, accurate identification of benthic megafauna is essential for developing science-based mining strategies. Deep learning has emerged as an promising approach in biological identification, offering significantly improved efficiency and reduced reliance on taxonomic expertise. However, challenges such as similar species characteristics, occlusion, and species abundance imbalance hinder accurate recognition of deep-sea benthic megafauna. To address these issues, the present study proposes Benthic Megafauna-YOLO (BM-YOLO). Its backbone integrates deformable convolutions, attention mechanisms, and ResNet structures to improve feature extraction and reduce background interference. Additionally, a transformer-enhanced PAFPN incorporates global context, improving occluded target recognition. To address the challenge of low-abundance species, we introduce a varifocal head that enhances the focus of the model on rare species. To address the lack of relevant datasets, we constructed the Pacific Ocean Benthic Megafauna dataset using images collected during the China Oceanic 45 and 50 surveys. Notably, BM-YOLO achieved AP50 scores exceeding 90 % and 95 % for six and four species, respectively. Furthermore, the model achieved the mAP50 and mAP50:95 values of 85.9 % and 56.2 %, respectively, improving on YOLOv8s by 2.6 % and 4.6 % and YOLOv11s by 0.7 % and 1.5 %. On the public underwater dataset DUO, BM-YOLO achieved mAP50 and mAP50:95 values of 82.4 % and 62.7 %, respectively, outperforming YOLOv10s by 2.0 % and 2.9 %. These results highlight the potential of BM-YOLO in automated deep-sea species identification. We also identified key factors influencing species recognition accuracy through experimental analysis.
format Article
id doaj-art-7f6da561d81a47ec8a4db0f95932b68f
institution Matheson Library
issn 1574-9541
language English
publishDate 2025-12-01
publisher Elsevier
record_format Article
series Ecological Informatics
spelling doaj-art-7f6da561d81a47ec8a4db0f95932b68f2025-07-13T04:53:41ZengElsevierEcological Informatics1574-95412025-12-0190103319Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learningGuofan Long0Wei Song1Xiangchun Liu2Ziyao Fang3Jinqi An4Kun Liu5Yaqin Huang6Xuebao He7School of Information Engineering, Minzu University of China, Beijing 100081, China; Laboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, ChinaSchool of Information Engineering, Minzu University of China, Beijing 100081, China; Key Laboratory of Ethnic Language Intelligent Analysis and Security Governance of MOE, Minzu University of China, Beijing 100081, China; Corresponding author at: School of Information Engineering, Minzu University of China, Beijing 100081, China.School of Information Engineering, Minzu University of China, Beijing 100081, ChinaSchool of Information Engineering, Minzu University of China, Beijing 100081, China; Laboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, ChinaLaboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, ChinaLaboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, ChinaLaboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, ChinaLaboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, China; Corresponding author at: Laboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, China.The exploitation of deep-sea polymetallic nodules has attracted global attention. To mitigate its impact on deep-sea ecosystems, accurate identification of benthic megafauna is essential for developing science-based mining strategies. Deep learning has emerged as an promising approach in biological identification, offering significantly improved efficiency and reduced reliance on taxonomic expertise. However, challenges such as similar species characteristics, occlusion, and species abundance imbalance hinder accurate recognition of deep-sea benthic megafauna. To address these issues, the present study proposes Benthic Megafauna-YOLO (BM-YOLO). Its backbone integrates deformable convolutions, attention mechanisms, and ResNet structures to improve feature extraction and reduce background interference. Additionally, a transformer-enhanced PAFPN incorporates global context, improving occluded target recognition. To address the challenge of low-abundance species, we introduce a varifocal head that enhances the focus of the model on rare species. To address the lack of relevant datasets, we constructed the Pacific Ocean Benthic Megafauna dataset using images collected during the China Oceanic 45 and 50 surveys. Notably, BM-YOLO achieved AP50 scores exceeding 90 % and 95 % for six and four species, respectively. Furthermore, the model achieved the mAP50 and mAP50:95 values of 85.9 % and 56.2 %, respectively, improving on YOLOv8s by 2.6 % and 4.6 % and YOLOv11s by 0.7 % and 1.5 %. On the public underwater dataset DUO, BM-YOLO achieved mAP50 and mAP50:95 values of 82.4 % and 62.7 %, respectively, outperforming YOLOv10s by 2.0 % and 2.9 %. These results highlight the potential of BM-YOLO in automated deep-sea species identification. We also identified key factors influencing species recognition accuracy through experimental analysis.http://www.sciencedirect.com/science/article/pii/S1574954125003280Deep-sea benthic megafauna recognitionDeep-sea ecosystem monitoringMarine organism recognition datasetsOcean miningUnderwater object detection and computer vision
spellingShingle Guofan Long
Wei Song
Xiangchun Liu
Ziyao Fang
Jinqi An
Kun Liu
Yaqin Huang
Xuebao He
Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning
Ecological Informatics
Deep-sea benthic megafauna recognition
Deep-sea ecosystem monitoring
Marine organism recognition datasets
Ocean mining
Underwater object detection and computer vision
title Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning
title_full Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning
title_fullStr Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning
title_full_unstemmed Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning
title_short Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning
title_sort automated recognition of deep sea benthic megafauna in polymetallic nodule mining areas based on deep learning
topic Deep-sea benthic megafauna recognition
Deep-sea ecosystem monitoring
Marine organism recognition datasets
Ocean mining
Underwater object detection and computer vision
url http://www.sciencedirect.com/science/article/pii/S1574954125003280
work_keys_str_mv AT guofanlong automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning
AT weisong automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning
AT xiangchunliu automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning
AT ziyaofang automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning
AT jinqian automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning
AT kunliu automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning
AT yaqinhuang automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning
AT xuebaohe automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning