Results for "data preservation"
How many requests or tokens can be processed per unit time; affects scalability and cost.
Hardware resources used for training/inference; constrained by memory bandwidth, FLOPs, and parallelism.
Exponential of average negative log-likelihood; lower means better predictive fit, not necessarily better utility.
Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.
System design where humans validate or guide model outputs, especially for high-stakes decisions.
Mechanisms for retaining context across turns/sessions: scratchpads, vector memories, structured stores.
Coordinating tools, models, and steps (retrieval, calls, validation) to deliver reliable end-to-end behavior.
Constraining model outputs into a schema used to call external APIs/tools safely and deterministically.
AI focused on interpreting images/video: classification, detection, segmentation, tracking, and 3D understanding.
AI subfield dealing with understanding and generating human language, including syntax, semantics, and pragmatics.
A model is PAC-learnable if it can, with high probability, learn an approximately correct hypothesis from finite samples.
Converting audio speech into text, often using encoder-decoder or transducer architectures.
Error due to sensitivity to fluctuations in the training dataset.
A measure of randomness or uncertainty in a probability distribution.
Variability introduced by minibatch sampling during SGD.
A wide basin often correlated with better generalization.
A narrow hidden layer forcing compact representations.
The range of functions a model can represent.
Allows model to attend to information from different subspaces simultaneously.
Encodes token position explicitly, often via sinusoids.
Categorizing AI applications by impact and regulatory risk.
Models trained to decide when to call tools.
Logged record of model inputs, outputs, and decisions.
Compromising AI systems via libraries, models, or datasets.
Graphs containing multiple node or edge types with different semantics.
Extension of convolution to graph domains using adjacency structure.
Controls amount of noise added at each diffusion step.
Combining signals from multiple modalities.
Simultaneous Localization and Mapping for robotics.
Predicting future values from past observations.