MiPower Software Modeling Videotutorial

Pusa: Thousands Timesteps Video Diffusion Model

Text-to-Video, Image-to-Video, Start-End Frames, Video Completion, Video Extension, Video Transition, and more.... Below are some showcases for Pusa-Wan2.2-V1. Please refer to Pusa V1.0 README for ...

GitHub

FAR: Frame Autoregressive Model for Both Short- and Long-Context Video Modeling

🔥 FAR leverages clean visual context without additional image-to-video fine-tuning: Unconditional pretraining on UCF-101 achieves state-of-the-art results in both video generation (context frame = 0) ...

IEEE

Transformer-Based Model for Monocular Visual Odometry: A Video Understanding Approach

Abstract: Estimating the camera’s pose given images from a single camera is a traditional task in mobile robots and autonomous vehicles. This problem is called monocular visual odometry and often ...

TechCrunch

Meta is developing a new image and video model for a 2026 release, report says

It’s all hands on deck at Meta, as the company develops new AI models under its superintelligence lab led by Scale AI co-founder, Alexandr Wang. The company is now working on an image and video model ...

Wall Street Journal

Meta Is Developing a New AI Image and Video Model Code-Named ‘Mango’

AI tools like Google’s Veo 3 and Runway can now create strikingly realistic video. WSJ’s Joanna Stern and Jarrard Cole put them to the test in a film made almost entirely with AI. Watch the film and ...

TechCrunch

Luma releases a new AI model that lets users generate a video from a start and end frame

Luma, the a16z-backed AI video and 3D model company, released a new model called Ray3 Modify that allows users to modify existing footage by providing character reference images that preserve the ...

about.fb

Our New SAM Audio Model Transforms Audio Editing

SAM Audio is the first unified AI model that can segment sound from complex audio mixtures using text, visual, and time span prompts. This technology has the potential to transform audio and video ...

IEEE

MMA: Video Reconstruction for Spike Camera Based on Multiscale Temporal Modeling and Fine-Grained Attention

Abstract: This paper presents a Multiscale Temporal Correlation Learning with the Mamba-Fused Attention Model (MMA), an efficient and effective method for reconstructing a video clip from a spike ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results