Results for "stochastic quantity"
Optimization under uncertainty.
Empirical laws linking model size, data, compute to performance.
Competitive advantage from proprietary models/data.
Approximating expectations via random sampling.
A gradient method using random minibatches for efficient training on large datasets.
Stochastic generation strategies that trade determinism for diversity; key knobs include temperature and nucleus sampling.
A branch of ML using multi-layer neural networks to learn hierarchical representations, often excelling in vision, speech, and language.
Iterative method that updates parameters in the direction of negative gradient to minimize loss.
Learning where data arrives sequentially and the model updates continuously, often under changing distributions.
Uses an exponential moving average of gradients to speed convergence and reduce oscillation.
Popular optimizer combining momentum and per-parameter adaptive step sizes via first/second moment estimates.
Number of samples per gradient update; impacts compute efficiency, generalization, and stability.
One complete traversal of the training dataset during training.
A parameterized function composed of interconnected units organized in layers with nonlinear activations.
Randomly zeroing activations during training to reduce co-adaptation and overfitting.
Training across many devices/silos without centralizing raw data; aggregates updates, not data.
Samples from the k highest-probability tokens to limit unlikely outputs.
Optimization with multiple local minima/saddle points; typical in neural networks.
Variability introduced by minibatch sampling during SGD.
Adjusting learning rate over training to improve convergence.
Limiting gradient magnitude to prevent exploding gradients.
Strategy mapping states to actions.
Optimizing policies directly via gradient ascent on expected reward.
Separates planning from execution in agent architectures.
Probabilistic energy-based neural network with hidden variables.
Sequential data indexed by time.
Using production outcomes to improve models.
Artificial environment for training/testing agents.
Variable whose values depend on chance.
AI-driven buying/selling of financial assets.