Learning JavaScript Video

Live: Learning Video LLM with Streaming Speech Transcription at Scale

Abstract: Recent video large language models (Video LLMs) often depend on costly human annotations or proprietary APIs (e.g., GPT-4o) to produce training data, which limits their training at scale. In ...

IEEE

Deep Learning-Based Object Tracking in Satellite Videos: A comprehensive survey with a new dataset

Abstract: As a fundamental task for research in satellite videos (SVs), object tracking is used to track the target of interest in traffic evaluation, military security, and so forth. The current ...

Ars Technica’s Top 20 video games of 2025

A mix of expected sequels and out-of-nowhere indie gems made 2025 a joy.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Live: Learning Video LLM with Streaming Speech Transcription at Scale

Deep Learning-Based Object Tracking in Satellite Videos: A comprehensive survey with a new dataset

Ars Technica’s Top 20 video games of 2025

Trending now