Multimodal Model
IntermediateModels that process or generate multiple modalities, enabling vision-language tasks, speech, video understanding, etc.
Full Definition
Models that process or generate multiple modalities, enabling vision-language tasks, speech, video understanding, etc.
Keywords
Domains
Related Terms
Concept Map
See how Multimodal Model connects to other concepts.
Open Knowledge Graph