Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Semantic Synthesis of Pedestrian Locomotion

Priisalu, Maria LU ; Paduraru, Ciprian ; Pirinen, Aleksis LU and Sminchisescu, Cristian LU (2021) 15th Asian Conference on Computer Vision, ACCV 2020 In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 12623 LNCS. p.470-487
Abstract

We present a model for generating 3d articulated pedestrian locomotion in urban scenarios, with synthesis capabilities informed by the 3d scene semantics and geometry. We reformulate pedestrian trajectory forecasting as a structured reinforcement learning (RL) problem. This allows us to naturally combine prior knowledge on collision avoidance, 3d human motion capture and the motion of pedestrians as observed e.g. in Cityscapes, Waymo or simulation environments like Carla. Our proposed RL-based model allows pedestrians to accelerate and slow down to avoid imminent danger (e.g. cars), while obeying human dynamics learnt from in-lab motion capture datasets. Specifically, we propose a hierarchical model consisting of a semantic trajectory... (More)

We present a model for generating 3d articulated pedestrian locomotion in urban scenarios, with synthesis capabilities informed by the 3d scene semantics and geometry. We reformulate pedestrian trajectory forecasting as a structured reinforcement learning (RL) problem. This allows us to naturally combine prior knowledge on collision avoidance, 3d human motion capture and the motion of pedestrians as observed e.g. in Cityscapes, Waymo or simulation environments like Carla. Our proposed RL-based model allows pedestrians to accelerate and slow down to avoid imminent danger (e.g. cars), while obeying human dynamics learnt from in-lab motion capture datasets. Specifically, we propose a hierarchical model consisting of a semantic trajectory policy network that provides a distribution over possible movements, and a human locomotion network that generates 3d human poses in each step. The RL-formulation allows the model to learn even from states that are seldom exhibited in the dataset, utilizing all of the available prior and scene information. Extensive evaluations using both real and simulated data illustrate that the proposed model is on par with recent models such as S-GAN, ST-GAT and S-STGCNN in pedestrian forecasting, while outperforming these in collision avoidance. We also show that our model can be used to plan goal reaching trajectories in urban scenes with dynamic actors.

(Less)
Please use this url to cite or link to this publication:
author
; ; and
organization
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
host publication
Computer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers
series title
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
editor
Ishikawa, Hiroshi ; Liu, Cheng-Lin ; Pajdla, Tomas and Shi, Jianbo
volume
12623 LNCS
pages
18 pages
publisher
Springer Science and Business Media B.V.
conference name
15th Asian Conference on Computer Vision, ACCV 2020
conference location
Virtual, Online
conference dates
2020-11-30 - 2020-12-04
external identifiers
  • scopus:85103292089
ISSN
0302-9743
1611-3349
ISBN
9783030695316
DOI
10.1007/978-3-030-69532-3_29
project
Modelling Pedestrians in Autonomous Vehicle Testing
language
English
LU publication?
yes
id
0d2faa60-34ef-4c43-a4f7-718a6f695001
date added to LUP
2021-04-08 12:32:45
date last changed
2025-04-04 15:04:45
@inproceedings{0d2faa60-34ef-4c43-a4f7-718a6f695001,
  abstract     = {{<p>We present a model for generating 3d articulated pedestrian locomotion in urban scenarios, with synthesis capabilities informed by the 3d scene semantics and geometry. We reformulate pedestrian trajectory forecasting as a structured reinforcement learning (RL) problem. This allows us to naturally combine prior knowledge on collision avoidance, 3d human motion capture and the motion of pedestrians as observed e.g. in Cityscapes, Waymo or simulation environments like Carla. Our proposed RL-based model allows pedestrians to accelerate and slow down to avoid imminent danger (e.g. cars), while obeying human dynamics learnt from in-lab motion capture datasets. Specifically, we propose a hierarchical model consisting of a semantic trajectory policy network that provides a distribution over possible movements, and a human locomotion network that generates 3d human poses in each step. The RL-formulation allows the model to learn even from states that are seldom exhibited in the dataset, utilizing all of the available prior and scene information. Extensive evaluations using both real and simulated data illustrate that the proposed model is on par with recent models such as S-GAN, ST-GAT and S-STGCNN in pedestrian forecasting, while outperforming these in collision avoidance. We also show that our model can be used to plan goal reaching trajectories in urban scenes with dynamic actors.</p>}},
  author       = {{Priisalu, Maria and Paduraru, Ciprian and Pirinen, Aleksis and Sminchisescu, Cristian}},
  booktitle    = {{Computer Vision – ACCV 2020 - 15th Asian Conference on Computer Vision, 2020, Revised Selected Papers}},
  editor       = {{Ishikawa, Hiroshi and Liu, Cheng-Lin and Pajdla, Tomas and Shi, Jianbo}},
  isbn         = {{9783030695316}},
  issn         = {{0302-9743}},
  language     = {{eng}},
  pages        = {{470--487}},
  publisher    = {{Springer Science and Business Media B.V.}},
  series       = {{Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)}},
  title        = {{Semantic Synthesis of Pedestrian Locomotion}},
  url          = {{http://dx.doi.org/10.1007/978-3-030-69532-3_29}},
  doi          = {{10.1007/978-3-030-69532-3_29}},
  volume       = {{12623 LNCS}},
  year         = {{2021}},
}