ScholarGate
助手

方法对比

并排查看您选择的方法;存在差异的行会高亮显示。

BERT 嵌入×Doc2Vec×GloVe 词嵌入×Word2Vec×
领域文本挖掘文本挖掘文本挖掘文本挖掘
方法族Process / pipelineProcess / pipelineProcess / pipelineProcess / pipeline
起源年份2019201420142013
提出者Devlin, Chang, Lee & Toutanova (Google AI)Quoc V. Le & Tomas MikolovPennington, Socher & ManningTomas Mikolov et al.
类型Contextual transformer text-representation methodDocument-embedding representation learningStatic word-embedding modelNeural word-embedding model
开创性文献Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. NAACL-HLT, 4171-4186. DOI ↗Le, Q. V. & Mikolov, T. (2014). Distributed Representations of Sentences and Documents. Proceedings of the 31st International Conference on Machine Learning (ICML), 1188-1196. link ↗Pennington, J., Socher, R. & Manning, C. D. (2014). GloVe: Global Vectors for Word Representation. EMNLP. DOI ↗Mikolov, T., Chen, K., Corrado, G. & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. link ↗
别名contextual embeddings, transformer embeddings, BERT Tabanlı Metin Gömülmeleriparagraph vector, document embeddings, Doc2Vec Belge GömülmeleriGloVe, global vectors, GloVe Kelime Gömülmeleriword embeddings, skip-gram, continuous bag-of-words, Word2Vec Kelime Gömülmeleri
相关4434
摘要BERT-based text embeddings, introduced by Devlin and colleagues at Google AI in 2019, turn text into context-sensitive dense vectors using a bidirectional Transformer encoder. Because the meaning of a word shifts with its context, BERT produces richer representations than static methods such as Word2Vec or topic models like LDA.Doc2Vec, also known as Paragraph Vector, is a representation-learning method introduced by Le and Mikolov (2014) that maps whole documents to fixed-length dense vectors. These vectors place similar documents close together in space, supporting document comparison and classification.GloVe (Global Vectors for Word Representation) is a static word-embedding model introduced by Pennington, Socher and Manning (2014) that learns word vectors directly from global word-word co-occurrence statistics gathered across an entire corpus. The resulting vectors place semantically related words close together and perform strongly on semantic analogy tasks.Word2Vec is a neural word-embedding technique introduced by Mikolov and colleagues in 2013 that maps each word in a text corpus to a dense numeric vector. Words that appear in similar contexts end up close together in the vector space, so the embeddings capture semantic similarity that can be measured arithmetically.
ScholarGate数据集
  1. v1
  2. 2 来源
  3. PUBLISHED
  1. v1
  2. 1 来源
  3. PUBLISHED
  1. v1
  2. 1 来源
  3. PUBLISHED
  1. v1
  2. 1 来源
  3. PUBLISHED

前往搜索 下载幻灯片

ScholarGate方法对比: BERT Embeddings · Doc2Vec · GloVe Embeddings · Word2Vec. 于 2026-06-18 检索自 https://scholargate.app/zh/compare