Prompt injection

Definition: Prompt injection is an attack where malicious content (in a page or document) hijacks a model's instructions to make it take unintended actions.

It's a major risk for tool-connected AI agents. The defense: treat external content as data, never as commands.

See also

← Full AI glossary · AI news