Results for "state-action value"

AdvertisementAd space — search-top

110 results

Exploration-Exploitation Tradeoff Intermediate

Balancing learning new behaviors vs exploiting known rewards.

AI Economics & Strategy
Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning
Active Inference Frontier

Acting to minimize surprise or free energy.

World Models & Cognition
Counterfactual Advanced

What would have happened under different conditions.

Causal AI & Interpretability
Observability Intermediate

A broader capability to infer internal system state from telemetry, crucial for AI services and agents.

Evaluation & Benchmarking
Hidden Markov Model Intermediate

Probabilistic model for sequential data with latent states.

Model Architectures
Controller Intermediate

Algorithm computing control actions.

Foundations & Theory
System Dynamics Advanced

Equations governing how system states change over time.

Dynamics & Physics
Planning Intermediate

Methods for breaking goals into steps; can be classical (A*, STRIPS) or LLM-driven with tool calls.

Foundations & Theory
Autonomous Agent Advanced

System that independently pursues goals over time.

Agents & Autonomy
Mean Squared Error Intermediate

Average of squared residuals; common regression objective.

Optimization
Gradient Descent Intermediate

Iterative method that updates parameters in the direction of negative gradient to minimize loss.

Optimization
SHAP Intermediate

Feature attribution method grounded in cooperative game theory for explaining predictions in tabular settings.

Foundations & Theory
Gradient Clipping Intermediate

Limiting gradient magnitude to prevent exploding gradients.

AI Economics & Strategy
Energy-Based Model Intermediate

Models that define an energy landscape rather than explicit probabilities.

Model Architectures
Singular Value Decomposition Advanced

Decomposes a matrix into orthogonal components; used in embeddings and compression.

Mathematics
Cross-Attention Intermediate

Attention between different modalities.

Computer Vision
Expectation Advanced

Average value under a distribution.

Probability & Statistics
Cooperative Game Advanced

Agents optimize collective outcomes.

Agents & Autonomy
Model-Free RL Advanced

RL without explicit dynamics model.

Reinforcement Learning
LSTM Intermediate

An RNN variant using gates to mitigate vanishing gradients and capture longer context.

Foundations & Theory
SLAM Intermediate

Simultaneous Localization and Mapping for robotics.

Computer Vision
Scratchpad Intro

Temporary reasoning space (often hidden).

Prompting & Instructions
Proprioception Advanced

Internal sensing of joint positions, velocities, and forces.

Robotics & Embodied AI
Linear Quadratic Regulator Intermediate

Optimal control for linear systems with quadratic cost.

Foundations & Theory
Object Permanence Frontier

Understanding objects exist when unseen.

World Models & Cognition
Unauthorized Practice of Law Intermediate

AI giving legal advice without authorization.

AI in Law
Agent Intermediate

A system that perceives state, selects actions, and pursues goals—often combining LLM reasoning with tools and memory.

Agents & Autonomy
System Prompt Intermediate

A high-priority instruction layer setting overarching behavior constraints for a chat model.

Reinforcement Learning
Planner-Executor Intermediate

Separates planning from execution in agent architectures.

AI Economics & Strategy

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.