Small Object Tracking in LiDAR Point Clouds: Learning the Target-Awareness Prototype and Fine-Grained Search Region

Light Detection and Ranging (LiDAR) point clouds are an essential perception modality for artificial intelligence systems like autonomous driving and robotics, where the ubiquity of small objects in real-world scenarios substantially challenges the visual tracking of small targets amidst the vastnes...

Full description

Saved in:
Bibliographic Details
Main Authors: Shengjing Tian, Yinan Han, Xiantong Zhao, Xiuping Liu
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/25/12/3633
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1839652713879568384
author Shengjing Tian
Yinan Han
Xiantong Zhao
Xiuping Liu
author_facet Shengjing Tian
Yinan Han
Xiantong Zhao
Xiuping Liu
author_sort Shengjing Tian
collection DOAJ
description Light Detection and Ranging (LiDAR) point clouds are an essential perception modality for artificial intelligence systems like autonomous driving and robotics, where the ubiquity of small objects in real-world scenarios substantially challenges the visual tracking of small targets amidst the vastness of point cloud data. Current methods predominantly focus on developing universal frameworks for general object categories, often sidelining the persistent difficulties associated with small objects. These challenges stem from a scarcity of foreground points and a low tolerance for disturbances. To this end, we propose a deep neural network framework that trains a Siamese network for feature extraction and innovatively incorporates two pivotal modules: the target-awareness prototype mining (TAPM) module and the regional grid subdivision (RGS) module. The TAPM module utilizes the reconstruction mechanism of the masked auto-encoder to distill prototypes within the feature space, thereby enhancing the salience of foreground points and aiding in the precise localization of small objects. To heighten the tolerance of disturbances in feature maps, the RGS module is devised to retrieve detailed features of the search area, capitalizing on Vision Transformer and pixel shuffle technologies. Furthermore, beyond standard experimental configurations, we have meticulously crafted scaling experiments to assess the robustness of various trackers when dealing with small objects. Comprehensive evaluations show our method achieves a mean Success of 64.9% and 60.4% under original and scaled settings, outperforming benchmarks by +3.6% and +5.4%, respectively.
format Article
id doaj-art-571937a2c83d4f2d98e2f35a8b5b01e5
institution Matheson Library
issn 1424-8220
language English
publishDate 2025-06-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj-art-571937a2c83d4f2d98e2f35a8b5b01e52025-06-25T14:25:19ZengMDPI AGSensors1424-82202025-06-012512363310.3390/s25123633Small Object Tracking in LiDAR Point Clouds: Learning the Target-Awareness Prototype and Fine-Grained Search RegionShengjing Tian0Yinan Han1Xiantong Zhao2Xiuping Liu3School of Economics and Management, China University of Mining and Technology, Xuzhou 221116, ChinaDUT-BSU Joint Institute, Dalian University of Technology, Dalian 116024, ChinaSchool of Mathematical Sciences, Dalian University of Technology, Dalian 116024, ChinaDUT-BSU Joint Institute, Dalian University of Technology, Dalian 116024, ChinaLight Detection and Ranging (LiDAR) point clouds are an essential perception modality for artificial intelligence systems like autonomous driving and robotics, where the ubiquity of small objects in real-world scenarios substantially challenges the visual tracking of small targets amidst the vastness of point cloud data. Current methods predominantly focus on developing universal frameworks for general object categories, often sidelining the persistent difficulties associated with small objects. These challenges stem from a scarcity of foreground points and a low tolerance for disturbances. To this end, we propose a deep neural network framework that trains a Siamese network for feature extraction and innovatively incorporates two pivotal modules: the target-awareness prototype mining (TAPM) module and the regional grid subdivision (RGS) module. The TAPM module utilizes the reconstruction mechanism of the masked auto-encoder to distill prototypes within the feature space, thereby enhancing the salience of foreground points and aiding in the precise localization of small objects. To heighten the tolerance of disturbances in feature maps, the RGS module is devised to retrieve detailed features of the search area, capitalizing on Vision Transformer and pixel shuffle technologies. Furthermore, beyond standard experimental configurations, we have meticulously crafted scaling experiments to assess the robustness of various trackers when dealing with small objects. Comprehensive evaluations show our method achieves a mean Success of 64.9% and 60.4% under original and scaled settings, outperforming benchmarks by +3.6% and +5.4%, respectively.https://www.mdpi.com/1424-8220/25/12/3633point cloudsLiDARsmall objectsvisual trackingdeep neural network
spellingShingle Shengjing Tian
Yinan Han
Xiantong Zhao
Xiuping Liu
Small Object Tracking in LiDAR Point Clouds: Learning the Target-Awareness Prototype and Fine-Grained Search Region
Sensors
point clouds
LiDAR
small objects
visual tracking
deep neural network
title Small Object Tracking in LiDAR Point Clouds: Learning the Target-Awareness Prototype and Fine-Grained Search Region
title_full Small Object Tracking in LiDAR Point Clouds: Learning the Target-Awareness Prototype and Fine-Grained Search Region
title_fullStr Small Object Tracking in LiDAR Point Clouds: Learning the Target-Awareness Prototype and Fine-Grained Search Region
title_full_unstemmed Small Object Tracking in LiDAR Point Clouds: Learning the Target-Awareness Prototype and Fine-Grained Search Region
title_short Small Object Tracking in LiDAR Point Clouds: Learning the Target-Awareness Prototype and Fine-Grained Search Region
title_sort small object tracking in lidar point clouds learning the target awareness prototype and fine grained search region
topic point clouds
LiDAR
small objects
visual tracking
deep neural network
url https://www.mdpi.com/1424-8220/25/12/3633
work_keys_str_mv AT shengjingtian smallobjecttrackinginlidarpointcloudslearningthetargetawarenessprototypeandfinegrainedsearchregion
AT yinanhan smallobjecttrackinginlidarpointcloudslearningthetargetawarenessprototypeandfinegrainedsearchregion
AT xiantongzhao smallobjecttrackinginlidarpointcloudslearningthetargetawarenessprototypeandfinegrainedsearchregion
AT xiupingliu smallobjecttrackinginlidarpointcloudslearningthetargetawarenessprototypeandfinegrainedsearchregion