[task] 3D Pose Estimation (in progress)

Definition

3D human pose estimation(HPE) task는 human body keypoint의 coordinate를 3d space에서 predict하는 task이다. 2d HPE task와 같이 joint의 spatial location으로 pose를 나타낸다. 또한 single-person과 multi-person estimation으로 나눌 수 있다.

Single Person 3D Pose Estimation

single person 3d pose estimation은 주로 두 가지 방법으로 진행된다:

direct estimation
2d to 3d lifting method

Multi-person 3D Pose Estimation in Videos

multi-person 3d pose estimation은 top-down 방식과 bottom-up 방식으로 나뉜다. bottom-up 방식은 estimation을 먼저 하고 association을 하는 방식이고, top-down 방식은 detection 후 estimation을 하는 방식이다.

top-down에서는 every person을 detect하고 bounding box 안에서 모든 keypoint를 찾는 방식이다. 이는 bounding box를 찾는데 robust하다는 특징이 있다.

bottom-up 방식에서는 scale variation을 다루는데 더 robust하다.

References

Footnotes

'DL·ML > Paper' 카테고리의 다른 글

MotionBERT (ICCV 2023) (0)	2024.03.26
Grounded SAM (0)	2024.03.25
VARS(SoccerNet) (0)	2024.03.22
HQ-SAM (0)	2024.03.20
3D vision, PointNet (0)	2024.03.19

Definition

Single Person 3D Pose Estimation

Multi-person 3D Pose Estimation in Videos

References

Footnotes

'DL·ML > Paper' 카테고리의 다른 글

티스토리툴바