Abstract
Hand detection and pose estimation are prominent problems in computer vision. They have applications in augmented and virtual reality, human-robot interaction, and gesture recognition which can be incorporated into controlling various interfaces, such as those used in assistive technology. The hand detection problem involves three sub-problems, i.e. hand localisation, hand classification, and pose estimation. Different hand detection methods approach this problem in multiple stages. However, there is a scope to train an end-to-end network that addresses these three problems at once. In this paper, we contribute to hand detection, classification, and pose estimation by first modifying the FreiHAND dataset to ensure both left and right hand images, along with their annotations, are present for training. Then, we train the YOLOv8-pose networks from nano to extra-large sizes to perform a comparative study of the performance of each network. Further, we perform quantitative and qualitative analysis on three public hand datasets that shows the strengths and limitations of YOLOv8-pose networks. Our experiments on training YOLOv8-pose networks from nano to extra-large sizes showed that the mean average precision score increases with the network size. We also conclude that the ratio of hand size to the image size in training affects the confidence score and classification during inference detection.
Original language | English |
---|---|
Title of host publication | 2024 IEEE Conference on Engineering Informatics, ICEI 2024 |
Place of Publication | Australia |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Number of pages | 6 |
ISBN (Electronic) | 979-8-3315-0577-6 |
DOIs | |
Publication status | Published - 12 Mar 2025 |
Event | The IEEE Conference on Engineering Informatics 2024 - Swinburne University of Technology, Melbourne , Australia Duration: 20 Nov 2024 → 21 Nov 2024 https://attend.ieee.org/engineering-informatics-2023/ |
Publication series
Name | 2024 IEEE Conference on Engineering Informatics, ICEI 2024 |
---|
Conference
Conference | The IEEE Conference on Engineering Informatics 2024 |
---|---|
Country/Territory | Australia |
City | Melbourne |
Period | 20/11/24 → 21/11/24 |
Internet address |
Keywords
- YOLO
- deep learning
- hand detection
- hand pose estimation