<i>Macao-ebird</i>: A Curated Dataset for Artificial-Intelligence-Powered Bird Surveillance and Conservation in Macao

Artificial intelligence (AI) currently exhibits considerable potential within the realm of biodiversity conservation. However, high-quality regionally customized datasets remain scarce, particularly within urban environments. The existing large-scale bird image datasets often lack a dedicated focus...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaoyuan Huang, Silvia Mirri, Su-Kit Tang
Format: Article
Language:English
Published: MDPI AG 2025-05-01
Series:Data
Subjects:
Online Access:https://www.mdpi.com/2306-5729/10/6/84
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Artificial intelligence (AI) currently exhibits considerable potential within the realm of biodiversity conservation. However, high-quality regionally customized datasets remain scarce, particularly within urban environments. The existing large-scale bird image datasets often lack a dedicated focus on endangered species endemic to specific geographic regions, as well as a nuanced consideration of the complex interplay between urban and natural environmental contexts. Therefore, this paper introduces <i>Macao-ebird</i>, a novel dataset designed to advance AI-driven recognition and conservation of endangered bird species in Macao. The dataset comprises two subsets: (1) <i>Macao-ebird-cls</i>, a classification dataset with 7341 images covering 24 bird species, emphasizing endangered and vulnerable species native to Macao; and (2) <i>Macao-ebird-det</i>, an object detection dataset generated through AI-agent-assisted labeling using grounding DETR with improved denoising anchor boxes (DINO), significantly reducing manual annotation effort while maintaining high-quality bounding-box annotations. We validate the dataset’s utility through baseline experiments with the You Only Look Once (YOLO) v8–v12 series, achieving a mean average precision (mAP50) of up to 0.984. <i>Macao-ebird</i> addresses critical gaps in the existing datasets by focusing on region-specific endangered species and complex urban–natural environments, providing a benchmark for AI applications in avian conservation.
ISSN:2306-5729