Machine learningDeep learning / NLP / CV
Self-supervised Word2Vec
Word2Vec is a shallow neural network model introduced by Mikolov et al. (2013) that learns dense vector representations of words from large unlabeled text corpora using self-supervised objectives. By training a model to predict surrounding context words (Skip-gram) or a target word from its context (CBOW), it captures rich semantic and syntactic regularities in continuous vector space without any manual annotation.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. In Proceedings of the International Conference on Learning Representations (ICLR 2013). link ↗
- Mikolov, T., Sutskever, I., Chen, K., Corrado, G., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems (NeurIPS 2013), 26. link ↗