Abstract: Recent video large language models (Video LLMs) often depend on costly human annotations or proprietary APIs (e.g., GPT-4o) to produce training data, which limits their training at scale. In ...
Abstract: As a fundamental task for research in satellite videos (SVs), object tracking is used to track the target of interest in traffic evaluation, military security, and so forth. The current ...
A mix of expected sequels and out-of-nowhere indie gems made 2025 a joy.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results