Σύγκριση μεθόδων
Εξετάστε τις επιλεγμένες μεθόδους δίπλα-δίπλα· οι γραμμές που διαφέρουν επισημαίνονται.
| GAN προσαρμογής πεδίου× | Προσαρμοστικός Μετασχηματιστής Όρασης (Domain-Adaptive Vision Transformer)× | |
|---|---|---|
| Πεδίο | Βαθιά Μάθηση | Βαθιά Μάθηση |
| Οικογένεια | Machine learning | Machine learning |
| Έτος προέλευσης≠ | 2016–2017 | 2021–2023 |
| Δημιουργός≠ | Ganin et al. (DANN); Zhu et al. (CycleGAN) | Multiple groups (Yang et al., 2023; Xu et al., 2021; Ma et al., 2022) |
| Τύπος≠ | Generative adversarial model with domain adaptation | Domain adaptation + Vision Transformer ensemble |
| Θεμελιώδης πηγή≠ | Ganin, Y., Ustunova, E., Ajakan, H., Germain, P., Larochelle, H., Laviolette, F., Marchand, M., & Lempitsky, V. (2016). Domain-adversarial training of neural networks. Journal of Machine Learning Research, 17(59), 1–35. link ↗ | Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., ... & Houlsby, N. (2021). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. International Conference on Learning Representations (ICLR). link ↗ |
| Εναλλακτικές ονομασίες | DA-GAN, domain adaptation GAN, adversarial domain adaptation, domain-adaptive generative adversarial network | DA-ViT, Domain Adaptation with Vision Transformer, ViT with Domain Adaptation, Domain-Adaptive ViT |
| Συναφείς≠ | 6 | 5 |
| Σύνοψη≠ | A Domain-Adaptive GAN combines generative adversarial learning with domain adaptation to bridge the distribution gap between a labeled source domain and an unlabeled or sparsely labeled target domain. By training a generator and discriminator adversarially, the model learns domain-invariant representations or translated samples, enabling a classifier or detector trained on source data to generalize effectively to the target domain without requiring abundant target labels. | Domain-Adaptive Vision Transformer (DA-ViT) applies domain adaptation techniques — such as adversarial alignment, self-training, or attention-level bridging — on top of a pretrained Vision Transformer backbone to transfer visual knowledge from a labeled source domain to an unlabeled or lightly labeled target domain, reducing the distribution shift that limits standard ViT fine-tuning. |
| ScholarGateΣύνολο δεδομένων ↗ |
|
|