Abstract: This paper shows that masked autoencoders (MAE) are scalable self-supervised learners for computer vision. Our MAE approach is simple: we mask random patches of the input image and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results