Comparer des méthodes
Examinez les méthodes sélectionnées côte à côte ; les lignes qui diffèrent sont mises en évidence.
| Variational Autoencoder affiné× | Modèle de diffusion affiné× | |
|---|---|---|
| Domaine | Apprentissage profond | Apprentissage profond |
| Famille | Machine learning | Machine learning |
| Année d'origine≠ | 2014 (VAE); fine-tuning practice from 2015 onward | 2020–2023 |
| Auteur d'origine≠ | Kingma, D. P. & Welling, M. (VAE); fine-tuning strategy from transfer learning literature | Ho, J., Jain, A., Abbeel, P. (base DDPM); Ruiz et al. (DreamBooth fine-tuning paradigm) |
| Type≠ | Generative model with fine-tuning | Generative model (fine-tuned via subject-specific or domain-specific data) |
| Source fondatrice≠ | Kingma, D. P., & Welling, M. (2014). Auto-Encoding Variational Bayes. In Proceedings of the 2nd International Conference on Learning Representations (ICLR 2014). link ↗ | Ruiz, N., Li, Y., Jampani, V., Pritch, Y., Rubinstein, M., & Aberman, K. (2023). DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 22500–22510. DOI ↗ |
| Alias | fine-tuned VAE, domain-adapted VAE, transfer-learned VAE, adapted variational autoencoder | DDPM fine-tuning, diffusion model adaptation, personalized diffusion model, subject-driven diffusion fine-tuning |
| Apparentées≠ | 6 | 5 |
| Résumé≠ | A Fine-Tuned Variational Autoencoder begins with a VAE pre-trained on a large source dataset and then continues training on a smaller target-domain dataset. This approach adapts the learned latent representation and generative capacity to new data, preserving general structure while specializing to the target distribution — yielding better results than training from scratch when labeled or large target data is scarce. | A fine-tuned diffusion model adapts a large pretrained denoising diffusion model — such as Stable Diffusion or DALL-E — to a specific subject, style, or domain by continuing training on a small curated dataset. Techniques such as DreamBooth, textual inversion, and LoRA make this adaptation feasible on consumer hardware while preserving general generative capability. |
| ScholarGateJeu de données ↗ |
|
|