Self-Supervised 3D Human Localization using Canonical Pose

Kim, Tae Hyung; Park, In Kyu

doi:10.1109/ICEIC69189.2026.11386354

상세 보기

Self-Supervised 3D Human Localization using Canonical Pose

Kim, Tae Hyung;
Park, In Kyu

Citations

SCOPUS

0

초록

Estimating absolute 3D human location from a single RGB image is inherently ambiguous due to inter-subject height variation and pose-induced scale changes. We present a self-supervised canonicalizer that learns, from 2D keypoints only, to (i) map an input pose to a canonical (near-upright) reference and (ii) predict a pose correction scalar that compensates for scale collapse caused by crouching and yaw rotation. Placed before MonoLoco++ as a preprocessing stage, our module stabilizes depth regression by acting as an input normalization mechanism. We benchmark on KITTI and report ALE (Average Localization Error). On KITTI, the All ALE improves from 0.74 m to 0.72 m. Similar gains are observed when training on nuScenes and evaluating on KITTI. The method requires no 3D ground truth and preserves real-time performance. © 2026 IEEE.

키워드

3D human localization; human pose; selfsupervision

제목: Self-Supervised 3D Human Localization using Canonical Pose

저자: Kim, Tae Hyung; Park, In Kyu

DOI: 10.1109/ICEIC69189.2026.11386354

발행일: 2026

유형: Conference paper

저널명: 2026 International Conference on Electronics, Information, and Communication, ICEIC 2026