Visual Objects Programming

Magma: A Foundation Model for Multimodal AI Agents

We present Magma, a foundation model that serves multimodal AI agentic tasks in both the digital and physical worlds. Magma is a significant extension of vision-language (VL) models in that it not ...

IEEE

AIVIO: Closed-Loop, Object-Relative Navigation of UAVs With AI-Aided Visual Inertial Odometry

Abstract: Object-relative mobile robot navigation is essential for a variety of tasks, e.g. autonomous critical infrastructure inspection, but requires the capability to extract semantic information ...

IEEE

Camouflaged Object Detection via Complementary Information-Selected Network Based on Visual and Semantic Separation

Abstract: Camouflaged object detection (COD) is a promising yet challenging task that aims to segment objects concealed within intricate surroundings, a capability crucial for modern industrial ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Magma: A Foundation Model for Multimodal AI Agents

AIVIO: Closed-Loop, Object-Relative Navigation of UAVs With AI-Aided Visual Inertial Odometry

Camouflaged Object Detection via Complementary Information-Selected Network Based on Visual and Semantic Separation

Trending now