DOI: 10.3390/app16136286 ISSN: 2076-3417

Distance-Aware DBSCAN–STM Pipeline with Centralized Point Augmentation for LiDAR-Based Pedestrian Candidate Generation

Jihwan Yeom, Jinman Kim, Joongjin Kook

This paper presents a non-learning-based, seed-dependent, semi-automatic pedestrian candidate generation pipeline for LiDAR point clouds. The proposed method is designed to support 3D annotation workflows by reducing irrelevant candidate clusters while improving the reliability of pedestrian candidate selection under distance-dependent point sparsity. The pipeline integrates distance-aware DBSCAN clustering, Single Template Matching (STM), and Centralized Point Augmentation (CPA). First, LiDAR points within the camera field of view are preprocessed, and pedestrian candidate clusters are generated using DBSCAN parameters configured according to distance intervals. Ground-snapping-based bounding-box refinement and height-based filtering are then applied to improve geometric consistency and reduce non-pedestrian candidates. In the second stage, STM compares PCA-aligned projected silhouettes of candidate clusters with a seed pedestrian template to suppress false positives. To address silhouette instability caused by sparse mid-range pedestrian points, CPA adds centroid-contracted points in the projection-relevant plane before template matching. Experiments on pedestrian-containing frames from the KITTI dataset show that STM improves precision from 27.6% to 60.5% and increases the F1-score from 36.8% to 51.4% compared with the initial DBSCAN-based candidate generation stage. The final CPA configuration improves recall from 44.7% to 46.7% and the overall F1-score from 51.4% to 52.1%, while revealing a precision–recall trade-off. Supplementary IoU analysis shows that the final DBSCAN–STM–CPA configuration maintains meaningful spatial overlap with pedestrian ground-truth boxes, achieving 88.9% at 3D IoU ≥ 0.10 and 81.6% at BEV IoU ≥ 0.25. Runtime analysis further shows that height-based filtering reduces the average per-frame processing time from 151.5 ms to 125.1 ms, while the final CPA configuration introduces only a small overhead, resulting in 126.2 ms per frame. These results demonstrate that the proposed DBSCAN–STM–CPA pipeline can provide reliable pedestrian candidates for semi-automatic 3D labeling without requiring class-specific detector training.

More from our Archive