Abstract: Vision Transformers (ViTs) have demonstrated exceptional performance in various vision tasks. However, they tend to underperform on smaller datasets due to their inherent lack of inductive ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results