ImgEdit is a large-scale, high-quality image-editing dataset comprising 1.2 million carefully curated edit pairs, which contain both novel and complex single-turn edits, as well as challenging ...
Abstract: In hazy weather with high humidity, insulators in overhead transmission lines are prone to flashover. However, fog reduces the sensitivity of image detection, which is a commonly used method ...
Abstract: This article introduces a task named visual grounding of remote sensing ship (VGRSS) images. The goal of VGRSS is to locate ship objects in remote sensing images guided by natural language.
The Git-10M dataset is a global-scale dataset, consisting of 10.5 million image-text pairs with geographical locations and resolution information. You can skip the following steps if you have higher ...