AI glossary
Every artificial-intelligence term explained simply: LLM, RAG, agents, tokens, fine-tuning, hallucination… Clear, up-to-date definitions.
- AGI (artificial general intelligence)AGI refers to a hypothetical AI as versatile as a human, able to perform any intellectual task.
- AI agentAn AI agent combines a model, tools and a decision loop to accomplish a multi-step task autonomously.
- Chain-of-thoughtChain-of-thought has a model reason step by step before answering, which markedly improves complex tasks (math, logic).
- Constitutional AIConstitutional AI is Anthropic's method for aligning Claude: the model follows a set of principles (a 'constitution') guiding helpful, honest and harmless responses.
- Context windowThe context window is the amount of text (in tokens) a model can consider at once — its 'working memory'.
- Diffusion modelA diffusion model generates an image (or video) by starting from random noise it progressively 'denoises' into the result.
- DistillationDistillation trains a small model ('student') to imitate a large one ('teacher'), yielding a lighter, faster model close to the original.
- EmbeddingsAn embedding is a numerical representation (a vector) of text that captures meaning: texts close in meaning have close vectors.
- Fine-tuningFine-tuning re-trains an existing model on specific data to adapt it to a domain or style.
- HallucinationA hallucination is an AI answer that is false but stated confidently — the model 'makes up' plausible information.
- InferenceInference is when an already-trained model produces an answer from an input — running the model, as opposed to training it.
- LLM (large language model)An LLM (Large Language Model) is an AI model trained on vast text corpora to understand and generate language.
- Mixture of ExpertsA Mixture of Experts (MoE) model activates only part of its parameters ('experts') per request, for more power at lower compute cost.
- MultimodalA multimodal model handles several input types — text, image, audio, sometimes video — not just text.
- Open weightsAn 'open-weight' model has its parameters published: you can download, self-host, audit and fine-tune it.
- ParametersParameters are the internal values a model learns during training; their count (often in billions) hints at its size.
- Prompt engineeringPrompt engineering is the craft of writing clear, structured instructions (prompts) to get the best answers from a model.
- Prompt injectionPrompt injection is an attack where malicious content (in a page or document) hijacks a model's instructions to make it take unintended actions.
- QuantizationQuantization reduces a model's numerical precision (e.g. from 16 to 8 bits) to make it lighter and faster, with limited quality loss.
- RAG (retrieval-augmented generation)RAG (Retrieval-Augmented Generation) feeds the model relevant documents retrieved on the fly, so it answers with up-to-date facts and cites sources.
- RLHF (reinforcement learning from human feedback)RLHF trains a model from human preferences: humans rate answers, and the model learns to produce the preferred ones.
- TemperatureTemperature controls the randomness of a model's output: low = safe, deterministic; high = more creative but less reliable.
- TokenA token is the basic unit an LLM processes: a chunk of a word (often ~4 characters in English). Text is split into tokens before processing.
- TransformerThe Transformer is the neural-network architecture behind modern LLMs, based on the 'attention' mechanism.
- Vibe coding'Vibe coding' is the practice of programming by describing your intent in natural language to an AI that writes and edits the code.