Do computer vision foundation models learn the low-level characteristics of the human visual system?
Abstract: Computer vision foundation models, such as DINO or OpenCLIP, are trained in a self-supervised manner on large image datasets. Analogously, substantial evidence suggests that the human visual ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results