Results for "vision"
Deep Learning
Intermediate
A branch of ML using multi-layer neural networks to learn hierarchical representations, often excelling in vision, speech, and language.
Adversarial Example
Intermediate
Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.
Multimodal Model
Intermediate
Models that process or generate multiple modalities, enabling vision-language tasks, speech, video understanding, etc.
CLIP
Intermediate
Joint vision-language model aligning images and text.
Sensor
Advanced
Devices measuring physical quantities (vision, lidar, force, IMU, etc.).
Exteroception
Advanced
External sensing of surroundings (vision, audio, lidar).
Computer Vision
Intermediate
AI focused on interpreting images/video: classification, detection, segmentation, tracking, and 3D understanding.
Vision Transformer
Intermediate
Transformer applied to image patches.