Comparer des méthodes
Examinez les méthodes sélectionnées côte à côte ; les lignes qui diffèrent sont mises en évidence.
| Modélisation thématique multilingue× | Modèle thématique NMF× | |
|---|---|---|
| Domaine | Apprentissage profond | Apprentissage profond |
| Famille | Machine learning | Machine learning |
| Année d'origine≠ | 2009 | 1999 |
| Auteur d'origine≠ | Mimno, D., Wallach, H. M., et al. | Lee, D. D. & Seung, H. S. |
| Type≠ | Probabilistic topic model (multilingual extension) | Matrix factorization / unsupervised topic model |
| Source fondatrice≠ | Mimno, D., Wallach, H. M., Naradowsky, J., Smith, D. A., & McCallum, A. (2009). Polylingual topic models. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 880–889. ACL. link ↗ | Lee, D. D., & Seung, H. S. (1999). Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755), 788–791. DOI ↗ |
| Alias | cross-lingual topic model, polylingual LDA, multilingual LDA, MLTM | NMF, Non-negative Matrix Factorization, NMF for Topic Modeling, NNMF Topic Model |
| Apparentées≠ | 5 | 4 |
| Résumé≠ | Multilingual topic modeling extends probabilistic topic models such as LDA to corpora spanning two or more languages, inferring shared latent topics across language boundaries. By tying topic distributions across languages, it enables cross-lingual document analysis, comparable topic discovery, and information retrieval without requiring full parallel corpora. | Non-negative Matrix Factorization (NMF) is an unsupervised matrix decomposition method that discovers latent topics in a text corpus by factoring a document-term matrix into two non-negative matrices — one encoding topic-word weights, the other document-topic weights. The non-negativity constraint yields parts-based, additive representations that tend to produce clean, interpretable topics. |
| ScholarGateJeu de données ↗ |
|
|