ScholarGate
Asszisztens

Módszerek összehasonlítása

Tekintse át a kiválasztott módszereket egymás mellett; az eltérő sorok kiemelve jelennek meg.

BERT-beágyazások×Doc2Vec×Word2Vec×
TudományterületSzövegbányászatSzövegbányászatSzövegbányászat
MódszercsaládProcess / pipelineProcess / pipelineProcess / pipeline
Keletkezés éve201920142013
MegalkotóDevlin, Chang, Lee & Toutanova (Google AI)Quoc V. Le & Tomas MikolovTomas Mikolov et al.
TípusContextual transformer text-representation methodDocument-embedding representation learningNeural word-embedding model
AlapműDevlin, J., Chang, M.-W., Lee, K. & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. NAACL-HLT, 4171-4186. DOI ↗Le, Q. V. & Mikolov, T. (2014). Distributed Representations of Sentences and Documents. Proceedings of the 31st International Conference on Machine Learning (ICML), 1188-1196. link ↗Mikolov, T., Chen, K., Corrado, G. & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. link ↗
Alternatív nevekcontextual embeddings, transformer embeddings, BERT Tabanlı Metin Gömülmeleriparagraph vector, document embeddings, Doc2Vec Belge Gömülmeleriword embeddings, skip-gram, continuous bag-of-words, Word2Vec Kelime Gömülmeleri
Kapcsolódó444
ÖsszefoglalóBERT-based text embeddings, introduced by Devlin and colleagues at Google AI in 2019, turn text into context-sensitive dense vectors using a bidirectional Transformer encoder. Because the meaning of a word shifts with its context, BERT produces richer representations than static methods such as Word2Vec or topic models like LDA.Doc2Vec, also known as Paragraph Vector, is a representation-learning method introduced by Le and Mikolov (2014) that maps whole documents to fixed-length dense vectors. These vectors place similar documents close together in space, supporting document comparison and classification.Word2Vec is a neural word-embedding technique introduced by Mikolov and colleagues in 2013 that maps each word in a text corpus to a dense numeric vector. Words that appear in similar contexts end up close together in the vector space, so the embeddings capture semantic similarity that can be measured arithmetically.
ScholarGateAdatkészlet
  1. v1
  2. 2 Források
  3. PUBLISHED
  1. v1
  2. 1 Források
  3. PUBLISHED
  1. v1
  2. 1 Források
  3. PUBLISHED

Ugrás a kereséshez Diák letöltése

ScholarGateMódszerek összehasonlítása: BERT Embeddings · Doc2Vec · Word2Vec. Letöltve 2026-06-18, forrás: https://scholargate.app/hu/compare