Abstract: The aim of the violent recognition task is to determine whether a video contains violent behaviors. Given that violent behavior often comes with visual and audio anomalies, multimodal ...
Abstract: Affective Video Facial Analysis (AVFA) is important for advancing emotion-aware AI, yet the persistent data scarcity in AVFA presents challenges. Recently, the self-supervised learning (SSL) ...
A collection of videos chosen for their oddly calming and satisfying visuals. Susie Wiles gets in trouble for saying what everyone knows Trump moves to dismantle major US climate research center in ...
Abstract: Weakly-supervised audio-visual video parsing (WS-AVVP) aims to localize the temporal extents of audio, visual and audio-visual event instances as well as identify the corresponding event ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results