Machine learning

Transformer (NLP)

The Transformer is an attention-based deep learning model, introduced by Vaswani and colleagues in 2017, that performs text classification, named-entity recognition, and language modelling by letting every token in a sequence attend directly to every other token. It replaced earlier recurrent designs with a self-attention mechanism that processes whole sequences in parallel.

Open in MethodMindSoonVideoSoon

Read the full method

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Vaswani, A. et al. (2017). Attention Is All You Need. NeurIPS. link

Related methods

Referenced by

ScholarGateTransformer (Transformer Model for Natural Language Processing). Retrieved 2026-06-04 from https://scholargate.app/en/deep-learning/transformer-nlp