Machine learningDeep Learning, Self-Supervised Learning

Masked Autoencoders

Masked Autoencoders (MAE) is a self-supervised learning approach introduced by He et al. in 2021 that masks random patches of an image and trains a model to reconstruct the missing content. Adapting the masked language modeling paradigm from NLP to vision, MAE learns rich visual representations by solving a challenging reconstruction task without requiring labels.

Open in MethodMindSoonVideoSoon

Read the full method

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. He, K., Chen, X., Xie, S., Li, Y., Dollár, P., & Girshick, R. (2022). Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 16000-16009). DOI: 10.1109/CVPR52688.2022.01553

Related methods

Referenced by

ScholarGateMasked Autoencoders (Masked Autoencoders are Scalable Vision Learners). Retrieved 2026-06-04 from https://scholargate.app/en/deep-learning/masked-autoencoders