We introduce a multi-label video recognition task focused on football referee decisions. This task involves multi-task classification of multi-view videos. You must assign two labels for each multi-view action: the first label determines whether an action is a foul, along with its corresponding severity, and the second label identifies the type of action.
We introduce a multi-label video recognition task focused on football referee decisions. This task involves multi-task classification of multi-view videos. You must assign two labels for each multi-view action: the first label determines whether an action is a foul, along with its corresponding severity, and the second label identifies the type of action.
First label: {No Offence, Offence + No Card, Offence + Yellow Card, Offence + Red Card} Second label: {Standing Tackle, Tackle, Holding, Pushing, Challenge, Dive, High Leg, Elbowing}
The data contains over 3,000 multi-view videos for training, validation, and testing and 273 actions for the challenge set.
The metric for evaluating your model is the mean of the two balanced accuracies for the two tasks.
@inproceedings{held2023vars,
title={VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views},
author={Held, Jan and Cioppa, Anthony and Giancola, Silvio and Hamdi, Abdullah and Ghanem, Bernard and Van Droogenbroeck, Marc},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={5085--5096},
year={2023}
}