Fine Grained Action Recognition of Skateboarding Tricks

Frederik Calsius, Mirela Popa, Alexia Briassouli

Research output: Contribution to conferencePaperAcademic

Abstract

In the field of machine learning, it is common practice to use benchmark datasets to prove the working of a method. The domain of action recognition in videos often uses datasets like Kinet-ics, Something-Something, UCF-101 and HMDB-51 to report results. Considering the properties of the datasets, there are no datasets that focus solely on very short clips (2 to 3 seconds), and on highly-similar fine-grained actions within one specific domain. This paper researches how current state-of-the-art action recognition methods perform on a dataset that consists of highly similar, fine-grained actions. To do so, a dataset of skateboarding tricks was created. The performed analysis highlights both benefits and limitations of state-of-the-art methods, while proposing future research directions in the activity recognition domain. The conducted research shows that the best results are obtained by fusing RGB data with OpenPose data for the Temporal Shift Module.

Conference

Conference15. International Conference on Computer Vision and Image Processing- Paris | World Academy of Science, Engineering and Technology
Abbreviated titleICCVIP 2021
Country/TerritoryFrance
CityParis
Period30/12/2131/12/21
Internet address

Cite this