Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Better Prior Knowledge Improves Human-Pose-Based Extrinsic Camera Calibration

Moliner, Olivier LU orcid ; Huang, Sangxia and Åström, Kalle LU orcid (2021) 2020 25th International Conference on Pattern Recognition In International Conference on Pattern Recognition p.4758-4765
Abstract
Accurate extrinsic calibration of wide baseline multi-camera systems enables better understanding of 3D scenes for many applications and is of great practical importance. Classical Structure-from-Motion calibration methods require special calibration equipment so that accurate point correspondences can be detected between different views. In addition, an operator with some training is usually needed to ensure that data is collected in a way that leads to good calibration accuracy. This limits the ease of adoption of such technologies. Recently, methods have been proposed to use human pose estimation models to establish point correspondences, thus removing the need for any special equipment. The challenge with this approach is that human... (More)
Accurate extrinsic calibration of wide baseline multi-camera systems enables better understanding of 3D scenes for many applications and is of great practical importance. Classical Structure-from-Motion calibration methods require special calibration equipment so that accurate point correspondences can be detected between different views. In addition, an operator with some training is usually needed to ensure that data is collected in a way that leads to good calibration accuracy. This limits the ease of adoption of such technologies. Recently, methods have been proposed to use human pose estimation models to establish point correspondences, thus removing the need for any special equipment. The challenge with this approach is that human pose estimation algorithms typically produce much less accurate feature points compared to classical patch-based methods. Another problem is that ambient human motion might not be optimal for calibration. We build upon prior works and introduce several novel ideas to improve the accuracy of human-pose-based extrinsic calibration. Our first contribution is a robust reprojection loss based on a better understanding of the sources of pose estimation error. Our second contribution is a 3D human pose likelihood model learned from motion capture data. We demonstrate significant improvements in calibration accuracy by evaluating our method on four publicly available datasets. (Less)
Abstract (Swedish)
Accurate extrinsic calibration of wide baseline multi-camera systems enables better understanding of 3D scenes for many applications and is of great practical importance. Classical Structure-from-Motion calibration methods require special calibration equipment so that accurate point correspondences can be detected between different views. In addition, an operator with some training is usually needed to ensure that data is collected in a way that leads to good calibration accuracy. This limits the ease of adoption of such technologies. Recently, methods have been proposed to use human pose estimation models to establish point correspondences, thus removing the need for any special equipment. The challenge with this approach is that human... (More)
Accurate extrinsic calibration of wide baseline multi-camera systems enables better understanding of 3D scenes for many applications and is of great practical importance. Classical Structure-from-Motion calibration methods require special calibration equipment so that accurate point correspondences can be detected between different views. In addition, an operator with some training is usually needed to ensure that data is collected in a way that leads to good calibration accuracy. This limits the ease of adoption of such technologies. Recently, methods have been proposed to use human pose estimation models to establish point correspondences, thus removing the need for any special equipment. The challenge with this approach is that human pose estimation algorithms typically produce much less accurate feature points compared to classical patch-based methods. Another problem is that ambient human motion might not be optimal for calibration. We build upon prior works and introduce several novel ideas to improve the accuracy of human-pose-based extrinsic calibration. Our first contribution is a robust reprojection loss based on a better understanding of the sources of pose estimation error. Our second contribution is a 3D human pose likelihood model learned from motion capture data. We demonstrate significant improvements in calibration accuracy by evaluating our method on four publicly available datasets. (Less)
Please use this url to cite or link to this publication:
author
; and
organization
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
host publication
2020 25th International Conference on Pattern Recognition (ICPR)
series title
International Conference on Pattern Recognition
pages
8 pages
publisher
IEEE - Institute of Electrical and Electronics Engineers Inc.
conference name
2020 25th International Conference on Pattern Recognition
conference location
Milan, Italy
conference dates
2021-01-10 - 2021-01-15
external identifiers
  • scopus:85110456147
ISSN
1051-4651
ISBN
978-1-7281-8808-9
DOI
10.1109/ICPR48806.2021.9411927
project
WASP: Wallenberg AI, Autonomous Systems and Software Program at Lund University
language
English
LU publication?
yes
id
a353c156-7910-483a-960c-abf85cf70ed4
date added to LUP
2021-05-27 11:13:32
date last changed
2023-09-12 20:25:44
@inproceedings{a353c156-7910-483a-960c-abf85cf70ed4,
  abstract     = {{Accurate extrinsic calibration of wide baseline multi-camera systems enables better understanding of 3D scenes for many applications and is of great practical importance. Classical Structure-from-Motion calibration methods require special calibration equipment so that accurate point correspondences can be detected between different views. In addition, an operator with some training is usually needed to ensure that data is collected in a way that leads to good calibration accuracy. This limits the ease of adoption of such technologies. Recently, methods have been proposed to use human pose estimation models to establish point correspondences, thus removing the need for any special equipment. The challenge with this approach is that human pose estimation algorithms typically produce much less accurate feature points compared to classical patch-based methods. Another problem is that ambient human motion might not be optimal for calibration. We build upon prior works and introduce several novel ideas to improve the accuracy of human-pose-based extrinsic calibration. Our first contribution is a robust reprojection loss based on a better understanding of the sources of pose estimation error. Our second contribution is a 3D human pose likelihood model learned from motion capture data. We demonstrate significant improvements in calibration accuracy by evaluating our method on four publicly available datasets.}},
  author       = {{Moliner, Olivier and Huang, Sangxia and Åström, Kalle}},
  booktitle    = {{2020 25th International Conference on Pattern Recognition (ICPR)}},
  isbn         = {{978-1-7281-8808-9}},
  issn         = {{1051-4651}},
  language     = {{eng}},
  month        = {{05}},
  pages        = {{4758--4765}},
  publisher    = {{IEEE - Institute of Electrical and Electronics Engineers Inc.}},
  series       = {{International Conference on Pattern Recognition}},
  title        = {{Better Prior Knowledge Improves Human-Pose-Based Extrinsic Camera Calibration}},
  url          = {{http://dx.doi.org/10.1109/ICPR48806.2021.9411927}},
  doi          = {{10.1109/ICPR48806.2021.9411927}},
  year         = {{2021}},
}