Transformer
Definition: The Transformer is the neural-network architecture behind modern LLMs, based on the 'attention' mechanism.
Introduced in 2017, it processes long sequences by weighting the importance of each element. It's the 'T' in GPT.
Definition: The Transformer is the neural-network architecture behind modern LLMs, based on the 'attention' mechanism.
Introduced in 2017, it processes long sequences by weighting the importance of each element. It's the 'T' in GPT.