Machine learningDeep Learning, Vision Transformers

Swin Transformer

Swin Transformer는 2021년 Liu 등이 소개한 계층적 비전 트랜스포머로, 계산 효율성을 달성하면서도 컴퓨터 비전 작업에서 강력한 성능을 유지하기 위해 이동 창(shifted window) 어텐션을 사용합니다. 원래의 Vision Transformer가 전역 자기 어텐션(global self-attention)을 적용하는 것과 달리, Swin은 표현력과 효율성의 균형을 맞추기 위해 주기적인 이동을 동반한 지역 창 기반 어텐션(local window-based attention)을 사용합니다.

MethodMind에서 열기곧 제공동영상곧 제공Download slides

방법 전문 읽기

회원 전용

무료 계정으로 로그인하면 이 섹션을 읽을 수 있습니다.

로그인

Method map

The neighbourhood of related methods — select a node to explore.

Swin Transformer

DETR (Detection Transfor…Masked Autoencoders Vision Mamba Vision Transformer 소수샷 객체 탐지 세그먼트 애니띵 모델 SimCLR 공간-시간 그래프 컨볼루션 네트워크

출처

Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., & Guo, B. (2021). Swin Transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 10012-10022). DOI: 10.1109/ICCV48922.2021.00986 ↗

이 페이지 인용 방법

ScholarGate. (2026, June 3). Shifted Window Transformer for Vision. ScholarGate. https://scholargate.app/ko/deep-learning/swin-transformer

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

DETR (Detection Transformer)딥러닝↔ compare
Masked Autoencoders딥러닝↔ compare
Vision Mamba딥러닝↔ compare
Vision Transformer딥러닝↔ compare

Compare side by side →

이 방법을 참조하는 항목

DETR (Detection Transformer)소수샷 객체 탐지 Masked Autoencoders 세그먼트 애니띵 모델 SimCLR 공간-시간 그래프 컨볼루션 네트워크 Vision Mamba

이 페이지에서 오류를 발견하셨나요? 신고하거나 수정을 제안하세요 →

방법 전문 읽기

Method map

출처

이 페이지 인용 방법

관련 방법

Which method?

이 방법을 참조하는 항목