Abstract
With the recent increased development of deep neural networks and dataset capabilities, the Human Action Recognition (HAR) domain is growing rapidly in terms of both the available datasets and deep models. Despite this, there are some lacks at datasets specifically covering the Robotics field and Human-Robot interaction. We prepare and introduce a new multi-view dataset to address this. The Robot House Multi-View dataset (RHM) contains four views: Front, Back, Ceiling, and Robot Views. There are 14 classes with 6701 video clips for each view, making a total of 26804 video clips for the four views. The lengths of the video clips are between 1 to 5 seconds. The videos with the same number and the same classes are synchronized in different views.
In the second part of this paper, we consider how single streams afford activity recognition using established state-of-the-art models. We then assess the affordance for each of the views based on information theoretic modelling and mutual information concept. Furthermore, we benchmark the performance of different views, thus establishing the strengths and weaknesses of each view relevant to their information content and performance of the benchmark. Our results lead us to conclude that multi-view and multi-stream activity recognition has the added potential to improve activity recognition results.
In the second part of this paper, we consider how single streams afford activity recognition using established state-of-the-art models. We then assess the affordance for each of the views based on information theoretic modelling and mutual information concept. Furthermore, we benchmark the performance of different views, thus establishing the strengths and weaknesses of each view relevant to their information content and performance of the benchmark. Our results lead us to conclude that multi-view and multi-stream activity recognition has the added potential to improve activity recognition results.
Original language | English |
---|---|
Title of host publication | ACHI 2023: The Sixteenth International Conference on Advances in Computer-Human Interactions |
Place of Publication | Venice, Italy |
Publisher | IARIA |
Number of pages | 7 |
ISBN (Electronic) | 978-1-68558-078-0 |
Publication status | Published - 28 Apr 2023 |
Event | ACHI 2023: The Sixteenth International Conference on Advances in Computer-Human Interactions - Venice, Italy Duration: 24 Apr 2023 → 28 Apr 2023 Conference number: 16 https://www.iaria.org/conferences2023/ACHI23.html |
Conference
Conference | ACHI 2023: The Sixteenth International Conference on Advances in Computer-Human Interactions |
---|---|
Abbreviated title | ACHI 2023 |
Country/Territory | Italy |
City | Venice |
Period | 24/04/23 → 28/04/23 |
Internet address |