Results for "trigger-based behavior"
Monte Carlo method for state estimation.
Flat high-dimensional regions slowing training.
Methods like Adam adjusting learning rates dynamically.
Classifying models by impact level.
Guaranteed response times.
Software simulating physical laws.
Predicts next state given current state and action.
Space of all possible robot configurations.
Sampling-based motion planner.
Learning by minimizing prediction error.
Software regulated as a medical device.
Deep learning system for protein structure prediction.
A subfield of AI where models learn patterns from data to make predictions or decisions, improving with experience rather than explicit rule-coding.
Training with a small labeled dataset plus a larger unlabeled dataset, leveraging assumptions like smoothness/cluster structure.
Learning where data arrives sequentially and the model updates continuously, often under changing distributions.
A continuous vector encoding of an item (word, image, user) such that semantic similarity corresponds to geometric closeness.
A function measuring prediction error (and sometimes calibration), guiding gradient-based optimization.
One complete traversal of the training dataset during training.
Methods to set starting weights to preserve signal/gradient scales across layers.
Mechanism that computes context-aware mixtures of representations; scales well and captures long-range dependencies.
An RNN variant using gates to mitigate vanishing gradients and capture longer context.
Injects sequence order into Transformers, since attention alone is permutation-invariant.
A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.
Predicts masked tokens in a sequence, enabling bidirectional context; often used for embeddings rather than generation.
Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.
Letting an LLM call external functions/APIs to fetch data, compute, or take actions, improving reliability.
Architecture that retrieves relevant documents (e.g., from a vector DB) and conditions generation on them to reduce hallucinations.
A datastore optimized for similarity search over embeddings, enabling semantic retrieval at scale.
Model trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.
Controlled experiment comparing variants by random assignment to estimate causal effects of changes.