Better Prior Knowledge Improves Human-Pose-Based Extrinsic Camera Calibration

Moliner, Olivier; Huang, Sangxia; Åström, Kalle (2021-05-05). Better Prior Knowledge Improves Human-Pose-Based Extrinsic Camera Calibration 2020 25th International Conference on Pattern Recognition (ICPR), 4758 - 4765. 2020 25th International Conference on Pattern Recognition. Milan, Italy: IEEE - Institute of Electrical and Electronics Engineers Inc.
Download:
DOI:
Conference Proceeding/Paper | Published | English
Authors:
Moliner, Olivier ; Huang, Sangxia ; Åström, Kalle
Department:
Mathematics (Faculty of Engineering)
Mathematical Imaging Group
eSSENCE: The e-Science Collaboration
ELLIIT: the Linköping-Lund initiative on IT and mobile communication
Project:
WASP: Wallenberg AI, Autonomous Systems and Software Program at Lund University
Research Group:
Mathematical Imaging Group
Abstract:
Accurate extrinsic calibration of wide baseline multi-camera systems enables better understanding of 3D scenes for many applications and is of great practical importance. Classical Structure-from-Motion calibration methods require special calibration equipment so that accurate point correspondences can be detected between different views. In addition, an operator with some training is usually needed to ensure that data is collected in a way that leads to good calibration accuracy. This limits the ease of adoption of such technologies. Recently, methods have been proposed to use human pose estimation models to establish point correspondences, thus removing the need for any special equipment. The challenge with this approach is that human pose estimation algorithms typically produce much less accurate feature points compared to classical patch-based methods. Another problem is that ambient human motion might not be optimal for calibration. We build upon prior works and introduce several novel ideas to improve the accuracy of human-pose-based extrinsic calibration. Our first contribution is a robust reprojection loss based on a better understanding of the sources of pose estimation error. Our second contribution is a 3D human pose likelihood model learned from motion capture data. We demonstrate significant improvements in calibration accuracy by evaluating our method on four publicly available datasets.

Accurate extrinsic calibration of wide baseline multi-camera systems enables better understanding of 3D scenes for many applications and is of great practical importance. Classical Structure-from-Motion calibration methods require special calibration equipment so that accurate point correspondences can be detected between different views. In addition, an operator with some training is usually needed to ensure that data is collected in a way that leads to good calibration accuracy. This limits the ease of adoption of such technologies. Recently, methods have been proposed to use human pose estimation models to establish point correspondences, thus removing the need for any special equipment. The challenge with this approach is that human pose estimation algorithms typically produce much less accurate feature points compared to classical patch-based methods. Another problem is that ambient human motion might not be optimal for calibration. We build upon prior works and introduce several novel ideas to improve the accuracy of human-pose-based extrinsic calibration. Our first contribution is a robust reprojection loss based on a better understanding of the sources of pose estimation error. Our second contribution is a 3D human pose likelihood model learned from motion capture data. We demonstrate significant improvements in calibration accuracy by evaluating our method on four publicly available datasets.
ISBN:
978-1-7281-8808-9
ISSN:
1051-4651
LUP-ID:
a353c156-7910-483a-960c-abf85cf70ed4 | Link: https://lup.lub.lu.se/record/a353c156-7910-483a-960c-abf85cf70ed4 | Statistics

Cite this