Results for "out-of-sample performance"
A wide basin often correlated with better generalization.
Adjusting learning rate over training to improve convergence.
Gradually increasing learning rate at training start to avoid divergence.
A narrow hidden layer forcing compact representations.
Tradeoffs between many layers vs many neurons per layer.
Allows model to attend to information from different subspaces simultaneously.
Encodes positional information via rotation in embedding space.
Techniques to handle longer documents without quadratic cost.
Routes inputs to subsets of parameters for scalable capacity.
Extending agents with long-term memory stores.
Chooses which experts process each token.
Multiple agents interacting cooperatively or competitively.
Models evaluating and improving their own outputs.
Framework for identifying, measuring, and mitigating model risks.
Central catalog of deployed and experimental models.
Neural networks that operate on graph-structured data by propagating information along edges.
GNN using attention to weight neighbor contributions dynamically.
Controls amount of noise added at each diffusion step.
Pixel-wise classification of image regions.
Transformer applied to image patches.
Maps audio signals to linguistic units.
Repeating temporal patterns.
Detects trigger phrases in audio streams.
Model execution path in production.
Centralized repository for curated features.
Shift in feature distribution over time.
Using production outcomes to improve models.
System that independently pursues goals over time.
Number of steps considered in planning.
Competitive advantage from proprietary models/data.