학술논문

Cofopose: Conditional 2D Pose Estimation with Transformers.

Document Type

Article

Author

Aidoo, Evans; Wang, Xun; Liu, Zhenguang; Tenagyei, Edwin Kwadwo; Owusu-Agyemang, Kwabena; Kodjiku, Seth Larweh; Ejianya, Victor Nonso; Aggrey, Esther Stacy E. B.

Source

Sensors (14248220). Sep2022, Vol. 22 Issue 18, p6821-N.PAG. 17p.

Subject

*Artificial vision
*Computer vision
*Artificial intelligence

Language

ISSN

1424-8220

Abstract

Human pose estimation has long been a fundamental problem in computer vision and artificial intelligence. Prominent among the 2D human pose estimation (HPE) methods are the regression-based approaches, which have been proven to achieve excellent results. However, the ground-truth labels are usually inherently ambiguous in challenging cases such as motion blur, occlusions, and truncation, leading to poor performance measurement and lower levels of accuracy. In this paper, we propose Cofopose, which is a two-stage approach consisting of a person and keypoint detection transformers for 2D human pose estimation. Cofopose is composed of conditional cross-attention, a conditional DEtection TRansformer (conditional DETR), and an encoder-decoder in the transformer framework; this allows it to achieve person and keypoint detection. In a significant departure from other approaches, we use conditional cross-attention and fine-tune conditional DETR for our person detection, and encoder-decoders in the transformers for our keypoint detection. Cofopose was extensively evaluated using two benchmark datasets, MS COCO and MPII, achieving an improved performance with significant margins over the existing state-of-the-art frameworks. [ABSTRACT FROM AUTHOR]

Online Access

EBSCOHost PDF Full Text (Gale Academic Onefile) Full Text (ProQuest Central) JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송