方法对比

并排查看您选择的方法；存在差异的行会高亮显示。

	掩码自编码器 ×	Swin Transformer ×
领域	深度学习	深度学习
方法族	Machine learning	Machine learning
起源年份	2021	2021
提出者≠	Kaiming He	Ze Liu
类型	Neural network architecture	Neural network architecture
开创性文献≠	He, K., Chen, X., Xie, S., Li, Y., Dollár, P., & Girshick, R. (2022). Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 16000-16009). DOI ↗	Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., & Guo, B. (2021). Swin Transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 10012-10022). DOI ↗
别名	MAE, Vision MAE	Swin, Hierarchical Vision Transformer
相关	4	4
摘要≠	Masked Autoencoders (MAE) is a self-supervised learning approach introduced by He et al. in 2021 that masks random patches of an image and trains a model to reconstruct the missing content. Adapting the masked language modeling paradigm from NLP to vision, MAE learns rich visual representations by solving a challenging reconstruction task without requiring labels.	The Swin Transformer is a hierarchical vision transformer introduced by Liu et al. in 2021 that uses shifted window attention to achieve computational efficiency while maintaining strong performance on computer vision tasks. Unlike the original Vision Transformer which applies global self-attention, Swin uses local window-based attention with periodic shifting to balance expressiveness and efficiency.
ScholarGate数据集 ↗	v1 1 来源 PUBLISHED	v1 1 来源 PUBLISHED

前往搜索 → 下载幻灯片