Results for "order information"
AI-assisted review of legal documents.
Requirement to preserve relevant data.
Standardized documentation describing intended use, performance, limitations, data, and ethical considerations.
When information from evaluation data improperly influences training, inflating reported performance.
Forcing predictable formats for downstream systems; reduces parsing errors and supports validation/guardrails.
A measure of randomness or uncertainty in a probability distribution.
Early architecture using learned gates for skip connections.
A single attention mechanism within multi-head attention.
Allows model to attend to information from different subspaces simultaneously.
Techniques to handle longer documents without quadratic cost.
Extracting system prompts or hidden instructions.
Structured graph encoding facts as entity–relation–entity triples.
Prompt augmented with retrieved documents.
Enables external computation or lookup.
Loss of old knowledge when learning new tasks.
Governance of model changes.
External sensing of surroundings (vision, audio, lidar).
Optimal pathfinding algorithm.
Learning without catastrophic forgetting.
Designing systems where rational agents behave as desired.
Agents copy others’ actions.
A branch of ML using multi-layer neural networks to learn hierarchical representations, often excelling in vision, speech, and language.
Learning structure from unlabeled data, such as discovering groups, compressing representations, or modeling data distributions.
Learning where data arrives sequentially and the model updates continuously, often under changing distributions.
Harmonic mean of precision and recall; useful when balancing false positives/negatives matters.
Networks with recurrent connections for sequences; largely supplanted by Transformers for many tasks.
Training objective where the model predicts the next token given previous tokens (causal modeling).
An RNN variant using gates to mitigate vanishing gradients and capture longer context.
Generates sequences one token at a time, conditioning on past tokens.
Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.