Abstract: Transformer-based object detection models usually adopt an encoding-decoding architecture that mainly combines self-attention (SA) and multilayer perceptron (MLP). Although this architecture ...
Abstract: This letter focuses on leveraging the object information in images to improve the performance of the U-Net based change detector. Change detection is fundamental to many computer vision ...
New VOS project: Putting the Object Back into Video Object Segmentation: https://github.com/hkchengrex/Cutie We frame Video Object Segmentation (VOS), first and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results