Understanding Video Transcoding

Online Video Understanding: OVBench and VideoChat-Online

Abstract: Multimodal Large Language Models (MLLMs) have significantly progressed in offline video understanding. However, applying these models to real-world scenarios, such as autonomous driving and ...

IEEE

BOLT: Boost Large Vision-Language Model Without Training for Long-Form Video Understanding

Abstract: Large video-language models (VLMs) have demonstrated promising progress in various video understanding tasks. However, their effectiveness in long-form video analysis is constrained by ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Online Video Understanding: OVBench and VideoChat-Online

BOLT: Boost Large Vision-Language Model Without Training for Long-Form Video Understanding

Trending now