Results for "demonstration-based"
A function measuring prediction error (and sometimes calibration), guiding gradient-based optimization.
Architecture based on self-attention and feedforward layers; foundation of modern LLMs and many multimodal models.
Retrieval based on embedding similarity rather than keyword overlap, capturing paraphrases and related concepts.
A preference-based training method optimizing policies directly from pairwise comparisons without explicit RL loops.
A measure of a model class’s expressive capacity based on its ability to shatter datasets.
Probabilistic energy-based neural network with hidden variables.
Continuous loop adjusting actions based on state feedback.
Sampling-based motion planner.
Models that define an energy landscape rather than explicit probabilities.
Learns the score (∇ log p(x)) for generative sampling.
Exact likelihood generative models using invertible transforms.
RL using learned or known environment models.