Machine learningDeep learning / NLP / CV
Fine-Tuned Topic Modeling
Fine-Tuned Topic Modeling adapts pre-trained language models — such as BERT or Sentence-BERT — to discover latent topics in document collections. Unlike classical probabilistic methods (LDA, NMF), it leverages rich contextual embeddings and optionally fine-tunes the backbone on domain-specific corpora, producing more coherent and semantically meaningful topics, especially on short texts or specialized domains.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Bianchi, F., Terragni, S., Hovy, D., Nozza, D., & Fersini, E. (2021). Cross-lingual Contextualized Topic Models with Zero-shot Learning. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, 1676–1683. DOI: 10.18653/v1/2021.eacl-main.143 ↗
- Grootendorst, M. (2022). BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv preprint arXiv:2203.05794. link ↗