Abstract
Ambient assisted living (AAL) systems aim to improve the safety, comfort, and quality of life for the populations with specific attention given to prolonging personal independence during later stages of life. Human activity recognition (HAR) plays a crucial role in enabling AAL systems to recognise and understand human actions. Multi-view human activity recognition (MV-HAR) techniques are particularly useful for AAL systems as they can use information from multiple sensors to capture different perspectives of human activities and can help to improve the robustness and accuracy of activity recognition.
In this work, we propose a lightweight activity recognition pipeline that utilizes skeleton data from multiple perspectives to combine the advantages of both approaches and thereby enhance an assistive robot's perception of human activity. The pipeline includes data sampling, input data type, and representation and classification methods. Our method modifies a classic LeNet classification model (M-LeNet) and uses a Vision Transformer (ViT) for the classification task. Experimental evaluation on a multi-perspective dataset of human activities in the home (RH-HAR-SK) compares the performance of these two models and indicates that combining camera views can improve recognition accuracy. Furthermore, our pipeline provides a more efficient and scalable solution in the AAL context, where bandwidth and computing resources are often limited.
In this work, we propose a lightweight activity recognition pipeline that utilizes skeleton data from multiple perspectives to combine the advantages of both approaches and thereby enhance an assistive robot's perception of human activity. The pipeline includes data sampling, input data type, and representation and classification methods. Our method modifies a classic LeNet classification model (M-LeNet) and uses a Vision Transformer (ViT) for the classification task. Experimental evaluation on a multi-perspective dataset of human activities in the home (RH-HAR-SK) compares the performance of these two models and indicates that combining camera views can improve recognition accuracy. Furthermore, our pipeline provides a more efficient and scalable solution in the AAL context, where bandwidth and computing resources are often limited.
Original language | English |
---|---|
Title of host publication | ACHI 2023: The Sixteenth International Conference on Advances in Computer-Human Interactions |
Place of Publication | Venice, Italy |
Publisher | IARIA |
ISBN (Electronic) | 978-1-68558-078-0 |
Publication status | Published - 28 Apr 2023 |
Event | ACHI 2023: The Sixteenth International Conference on Advances in Computer-Human Interactions - Venice, Italy Duration: 24 Apr 2023 → 28 Apr 2023 Conference number: 16 https://www.iaria.org/conferences2023/ACHI23.html |
Conference
Conference | ACHI 2023: The Sixteenth International Conference on Advances in Computer-Human Interactions |
---|---|
Abbreviated title | ACHI 2023 |
Country/Territory | Italy |
City | Venice |
Period | 24/04/23 → 28/04/23 |
Internet address |