Multimodal

Definition: A multimodal model handles several input types — text, image, audio, sometimes video — not just text.

Claude, for instance, can analyze an image alongside text. Multimodality greatly broadens use cases.

See also

← Full AI glossary · AI news