Confronta i metodi
Esamina i metodi selezionati fianco a fianco; le righe che differiscono sono evidenziate.
| FastText× | Naive Bayes× | Word2Vec× | |
|---|---|---|---|
| Campo≠ | Apprendimento profondo | Apprendimento automatico | Text mining |
| Famiglia≠ | Machine learning | Machine learning | Process / pipeline |
| Anno di origine≠ | 2016 | 1997 | 2013 |
| Ideatore≠ | Joulin, A.; Bojanowski, P.; Grave, E.; Mikolov, T. (Facebook AI Research) | Mitchell, T. M. (textbook treatment) | Tomas Mikolov et al. |
| Tipo≠ | Subword embedding model and linear text classifier | Probabilistic classifier (Bayes' theorem with conditional independence) | Neural word-embedding model |
| Fonte seminale≠ | Joulin, A., Grave, E., Bojanowski, P. & Mikolov, T. (2017). Bag of Tricks for Efficient Text Classification. In Proceedings of EACL 2017, Short Papers, pp. 427–431. ACL. DOI ↗ | Mitchell, T. M. (1997). Machine Learning. McGraw-Hill. ISBN: 978-0070428072 | Mikolov, T., Chen, K., Corrado, G. & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. link ↗ |
| Alias≠ | fastText, fast text, subword embedding, character n-gram embedding | Naive Bayes Sınıflandırıcı, naive bayes classifier, simple Bayes, Gaussian Naive Bayes | word embeddings, skip-gram, continuous bag-of-words, Word2Vec Kelime Gömülmeleri |
| Correlati≠ | 2 | 4 | 4 |
| Sintesi≠ | FastText is a word embedding and text classification framework developed by Facebook AI Research (Joulin, Bojanowski, Grave, and Mikolov, 2016–2017) that represents each word as the sum of its character n-gram vectors, allowing it to construct meaningful representations for unseen and morphologically rich words and to perform near state-of-the-art text classification orders of magnitude faster than deep neural network alternatives. | Naive Bayes is a fast probabilistic classifier that applies Bayes' theorem while assuming that the features are conditionally independent given the class — a method given its standard machine-learning treatment in Tom Mitchell's 1997 textbook Machine Learning. Despite this simplifying ('naive') assumption, it is quick to train and often surprisingly accurate. | Word2Vec is a neural word-embedding technique introduced by Mikolov and colleagues in 2013 that maps each word in a text corpus to a dense numeric vector. Words that appear in similar contexts end up close together in the vector space, so the embeddings capture semantic similarity that can be measured arithmetically. |
| ScholarGateInsieme di dati ↗ |
|
|
|