Guardrails
Definition: Guardrails are the mechanisms that constrain a model's inputs and outputs to keep it within a safe and compliant scope.
They combine filters, rules, content classification and system instructions. They aim to block misuse without needlessly degrading usefulness.