HTML CSS Slider Image and Text

Iterative Adversarial Attack on Image-Guided Story Ending Generation

Abstract: Multimodal learning involves developing models that can integrate information from various sources like images and texts. In this field, multimodal text generation is a crucial aspect that ...

IEEE

AMITA: Attribute-Guided Masked Image-Text Alignment for Multi-Label Image Representation

Abstract: Multi-label image classification, which involves recognizing multiple objects within a single image, is a fundamental task in computer vision. Recently, Visual-Language Models (VLMs) have ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Iterative Adversarial Attack on Image-Guided Story Ending Generation

AMITA: Attribute-Guided Masked Image-Text Alignment for Multi-Label Image Representation

Trending now