Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning
The exploitation of deep-sea polymetallic nodules has attracted global attention. To mitigate its impact on deep-sea ecosystems, accurate identification of benthic megafauna is essential for developing science-based mining strategies. Deep learning has emerged as an promising approach in biological...
Saved in:
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-12-01
|
Series: | Ecological Informatics |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S1574954125003280 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1839630876426633216 |
---|---|
author | Guofan Long Wei Song Xiangchun Liu Ziyao Fang Jinqi An Kun Liu Yaqin Huang Xuebao He |
author_facet | Guofan Long Wei Song Xiangchun Liu Ziyao Fang Jinqi An Kun Liu Yaqin Huang Xuebao He |
author_sort | Guofan Long |
collection | DOAJ |
description | The exploitation of deep-sea polymetallic nodules has attracted global attention. To mitigate its impact on deep-sea ecosystems, accurate identification of benthic megafauna is essential for developing science-based mining strategies. Deep learning has emerged as an promising approach in biological identification, offering significantly improved efficiency and reduced reliance on taxonomic expertise. However, challenges such as similar species characteristics, occlusion, and species abundance imbalance hinder accurate recognition of deep-sea benthic megafauna. To address these issues, the present study proposes Benthic Megafauna-YOLO (BM-YOLO). Its backbone integrates deformable convolutions, attention mechanisms, and ResNet structures to improve feature extraction and reduce background interference. Additionally, a transformer-enhanced PAFPN incorporates global context, improving occluded target recognition. To address the challenge of low-abundance species, we introduce a varifocal head that enhances the focus of the model on rare species. To address the lack of relevant datasets, we constructed the Pacific Ocean Benthic Megafauna dataset using images collected during the China Oceanic 45 and 50 surveys. Notably, BM-YOLO achieved AP50 scores exceeding 90 % and 95 % for six and four species, respectively. Furthermore, the model achieved the mAP50 and mAP50:95 values of 85.9 % and 56.2 %, respectively, improving on YOLOv8s by 2.6 % and 4.6 % and YOLOv11s by 0.7 % and 1.5 %. On the public underwater dataset DUO, BM-YOLO achieved mAP50 and mAP50:95 values of 82.4 % and 62.7 %, respectively, outperforming YOLOv10s by 2.0 % and 2.9 %. These results highlight the potential of BM-YOLO in automated deep-sea species identification. We also identified key factors influencing species recognition accuracy through experimental analysis. |
format | Article |
id | doaj-art-7f6da561d81a47ec8a4db0f95932b68f |
institution | Matheson Library |
issn | 1574-9541 |
language | English |
publishDate | 2025-12-01 |
publisher | Elsevier |
record_format | Article |
series | Ecological Informatics |
spelling | doaj-art-7f6da561d81a47ec8a4db0f95932b68f2025-07-13T04:53:41ZengElsevierEcological Informatics1574-95412025-12-0190103319Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learningGuofan Long0Wei Song1Xiangchun Liu2Ziyao Fang3Jinqi An4Kun Liu5Yaqin Huang6Xuebao He7School of Information Engineering, Minzu University of China, Beijing 100081, China; Laboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, ChinaSchool of Information Engineering, Minzu University of China, Beijing 100081, China; Key Laboratory of Ethnic Language Intelligent Analysis and Security Governance of MOE, Minzu University of China, Beijing 100081, China; Corresponding author at: School of Information Engineering, Minzu University of China, Beijing 100081, China.School of Information Engineering, Minzu University of China, Beijing 100081, ChinaSchool of Information Engineering, Minzu University of China, Beijing 100081, China; Laboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, ChinaLaboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, ChinaLaboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, ChinaLaboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, ChinaLaboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, China; Corresponding author at: Laboratory of Marine Biodiversity, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, China.The exploitation of deep-sea polymetallic nodules has attracted global attention. To mitigate its impact on deep-sea ecosystems, accurate identification of benthic megafauna is essential for developing science-based mining strategies. Deep learning has emerged as an promising approach in biological identification, offering significantly improved efficiency and reduced reliance on taxonomic expertise. However, challenges such as similar species characteristics, occlusion, and species abundance imbalance hinder accurate recognition of deep-sea benthic megafauna. To address these issues, the present study proposes Benthic Megafauna-YOLO (BM-YOLO). Its backbone integrates deformable convolutions, attention mechanisms, and ResNet structures to improve feature extraction and reduce background interference. Additionally, a transformer-enhanced PAFPN incorporates global context, improving occluded target recognition. To address the challenge of low-abundance species, we introduce a varifocal head that enhances the focus of the model on rare species. To address the lack of relevant datasets, we constructed the Pacific Ocean Benthic Megafauna dataset using images collected during the China Oceanic 45 and 50 surveys. Notably, BM-YOLO achieved AP50 scores exceeding 90 % and 95 % for six and four species, respectively. Furthermore, the model achieved the mAP50 and mAP50:95 values of 85.9 % and 56.2 %, respectively, improving on YOLOv8s by 2.6 % and 4.6 % and YOLOv11s by 0.7 % and 1.5 %. On the public underwater dataset DUO, BM-YOLO achieved mAP50 and mAP50:95 values of 82.4 % and 62.7 %, respectively, outperforming YOLOv10s by 2.0 % and 2.9 %. These results highlight the potential of BM-YOLO in automated deep-sea species identification. We also identified key factors influencing species recognition accuracy through experimental analysis.http://www.sciencedirect.com/science/article/pii/S1574954125003280Deep-sea benthic megafauna recognitionDeep-sea ecosystem monitoringMarine organism recognition datasetsOcean miningUnderwater object detection and computer vision |
spellingShingle | Guofan Long Wei Song Xiangchun Liu Ziyao Fang Jinqi An Kun Liu Yaqin Huang Xuebao He Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning Ecological Informatics Deep-sea benthic megafauna recognition Deep-sea ecosystem monitoring Marine organism recognition datasets Ocean mining Underwater object detection and computer vision |
title | Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning |
title_full | Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning |
title_fullStr | Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning |
title_full_unstemmed | Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning |
title_short | Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning |
title_sort | automated recognition of deep sea benthic megafauna in polymetallic nodule mining areas based on deep learning |
topic | Deep-sea benthic megafauna recognition Deep-sea ecosystem monitoring Marine organism recognition datasets Ocean mining Underwater object detection and computer vision |
url | http://www.sciencedirect.com/science/article/pii/S1574954125003280 |
work_keys_str_mv | AT guofanlong automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning AT weisong automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning AT xiangchunliu automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning AT ziyaofang automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning AT jinqian automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning AT kunliu automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning AT yaqinhuang automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning AT xuebaohe automatedrecognitionofdeepseabenthicmegafaunainpolymetallicnoduleminingareasbasedondeeplearning |