3D Associative Embedding: Multi-View 3D Human Pose Estimation in Crowded Scenes

摘要

Most of the existing multi-view multi-person 3D human pose estimation methods predict the location of each joint of one target person following a top-down paradigm after finding his region. However, these works neglect the interference of others’ joints in the region. When the scene is crowded and the target person is surrounded by others, the information of his joints tends to be disturbed which results in significant errors in 3D results. To overcome this problem, this paper takes advantage of a bottom-up method in 2D pose estimation. We incorporate the Associative Embedding method into 3D pose estimation and propose a Voxel Hourglass Network to predict 3D heatmaps along with 3D tag-maps. As a result, the adverse effects from surrounding persons can be eliminated through the difference between tags. Moreover, we design a three-stage coarse-to-fine framework which can effectively reduce the quantization error. The size of the search space drops at each stage while the resolution increases. We test our method on the CMU Panoptic dataset where it outperforms the related top-down methods.

出版物
In Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things
Zhiyi Zhu(朱治亦)
硕士(2020-2023)

简略介绍

Sheng Liu(刘晟)
Sheng Liu(刘晟)
硕博连读(2021-)

简略介绍

Jianghai Shuai(帅江海)
Jianghai Shuai(帅江海)
硕士(2022-)

简略介绍

Sidan Du(都思丹)
Sidan Du(都思丹)
教授

简略介绍

Yang Li(李杨)
Yang Li(李杨)
副教授

简略介绍