Transformer

Definition: The Transformer is the neural-network architecture behind modern LLMs, based on the 'attention' mechanism.

Introduced in 2017, it processes long sequences by weighting the importance of each element. It's the 'T' in GPT.

See also

← Full AI glossary · AI news