Multimodal Model

Intermediate

Models that process or generate multiple modalities, enabling vision-language tasks, speech, video understanding, etc.

Full Definition

Models that process or generate multiple modalities, enabling vision-language tasks, speech, video understanding, etc.

Keywords

Domains

Related Terms

Concept Map

See how Multimodal Model connects to other concepts.

Open Knowledge Graph