A Self-supervised Gaussian Mixture Model (SS-GMM) combines self-supervised representation learning with a probabilistic Gaussian mixture prior to discover meaningful clusters in unlabeled or partially labeled data. By leveraging pretext tasks to learn rich embeddings before fitting a GMM, it achieves cluster quality that standard GMMs applied to raw features rarely reach, especially on complex image, text, or biological data.
Zhai, X., Oliver, A., Kolesnikov, A., & Beyer, L. (2019). S4L: Self-supervised semi-supervised learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 1476–1485. link ↗