Results for "learning rate"
Learning Rate
IntermediateControls the size of parameter updates; too high diverges, too low trains slowly or gets stuck.
Think of the learning rate as the size of your steps when walking towards a destination. If you take giant steps, you might overshoot and miss your goal, but if you take tiny steps, you might take forever to get there. In machine learning, the learning rate controls how big of a change we make to...
Balancing learning new behaviors vs exploiting known rewards.
Continuous cycle of observation, reasoning, action, and feedback.
Loss of old knowledge when learning new tasks.
Combining simulation and real-world data.
RL without explicit dynamics model.
Learned model of environment dynamics.
Learning without catastrophic forgetting.
Deep learning system for protein structure prediction.
AI limited to specific domains.
A measure of a model class’s expressive capacity based on its ability to shatter datasets.
A measurable property or attribute used as model input (raw or engineered), such as age, pixel intensity, or token ID.
A function measuring prediction error (and sometimes calibration), guiding gradient-based optimization.
Practices for operationalizing ML: versioning, CI/CD, monitoring, retraining, and reliable production management.
Automated testing and deployment processes for models and data workflows, extending DevOps to ML artifacts.
Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.
Reconstructing a model or its capabilities via API queries or leaked artifacts.
Reduction in uncertainty achieved by observing a variable; used in decision trees and active learning.
All possible configurations an agent may encounter.
Expected return of taking action in a state.
Models evaluating and improving their own outputs.
Probabilistic energy-based neural network with hidden variables.
Simplified Boltzmann Machine with bipartite structure.
Increasing performance via more data.
Visualization of optimization landscape.
Flat high-dimensional regions slowing training.
Optimization under uncertainty.
Model trained on its own outputs degrades quality.
RL using learned or known environment models.
Predicts next state given current state and action.
Learning action mapping directly from demonstrations.