ScholarGate
Asistent

Porovnat metody

Prohlédněte si vybrané metody vedle sebe; řádky, které se liší, jsou zvýrazněny.

FastText×Naive Bayes×Word2Vec×
OborHluboké učeníStrojové učeníDolování textu
RodinaMachine learningMachine learningProcess / pipeline
Rok vzniku201619972013
TvůrceJoulin, A.; Bojanowski, P.; Grave, E.; Mikolov, T. (Facebook AI Research)Mitchell, T. M. (textbook treatment)Tomas Mikolov et al.
TypSubword embedding model and linear text classifierProbabilistic classifier (Bayes' theorem with conditional independence)Neural word-embedding model
Původní zdrojJoulin, A., Grave, E., Bojanowski, P. & Mikolov, T. (2017). Bag of Tricks for Efficient Text Classification. In Proceedings of EACL 2017, Short Papers, pp. 427–431. ACL. DOI ↗Mitchell, T. M. (1997). Machine Learning. McGraw-Hill. ISBN: 978-0070428072Mikolov, T., Chen, K., Corrado, G. & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. link ↗
Další názvyfastText, fast text, subword embedding, character n-gram embeddingNaive Bayes Sınıflandırıcı, naive bayes classifier, simple Bayes, Gaussian Naive Bayesword embeddings, skip-gram, continuous bag-of-words, Word2Vec Kelime Gömülmeleri
Příbuzné244
ShrnutíFastText is a word embedding and text classification framework developed by Facebook AI Research (Joulin, Bojanowski, Grave, and Mikolov, 2016–2017) that represents each word as the sum of its character n-gram vectors, allowing it to construct meaningful representations for unseen and morphologically rich words and to perform near state-of-the-art text classification orders of magnitude faster than deep neural network alternatives.Naive Bayes is a fast probabilistic classifier that applies Bayes' theorem while assuming that the features are conditionally independent given the class — a method given its standard machine-learning treatment in Tom Mitchell's 1997 textbook Machine Learning. Despite this simplifying ('naive') assumption, it is quick to train and often surprisingly accurate.Word2Vec is a neural word-embedding technique introduced by Mikolov and colleagues in 2013 that maps each word in a text corpus to a dense numeric vector. Words that appear in similar contexts end up close together in the vector space, so the embeddings capture semantic similarity that can be measured arithmetically.
ScholarGateDatová sada
  1. v1
  2. 3 Zdroje
  3. PUBLISHED
  1. v1
  2. 1 Zdroje
  3. PUBLISHED
  1. v1
  2. 1 Zdroje
  3. PUBLISHED

Přejít na hledání Stáhnout prezentaci

ScholarGatePorovnat metody: FastText · Naive Bayes · Word2Vec. Získáno 2026-06-17 z https://scholargate.app/cs/compare