Search: state-action value

Exploration-Exploitation Tradeoff Intermediate

Balancing learning new behaviors vs exploiting known rewards.

AI Economics & Strategy

Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning

Active Inference Frontier

Acting to minimize surprise or free energy.

World Models & Cognition

Counterfactual Advanced

What would have happened under different conditions.

Causal AI & Interpretability

Observability Intermediate

A broader capability to infer internal system state from telemetry, crucial for AI services and agents.

Evaluation & Benchmarking

Hidden Markov Model Intermediate

Probabilistic model for sequential data with latent states.

Model Architectures

Controller Intermediate

Algorithm computing control actions.

Foundations & Theory

System Dynamics Advanced

Equations governing how system states change over time.

Dynamics & Physics

Planning Intermediate

Methods for breaking goals into steps; can be classical (A*, STRIPS) or LLM-driven with tool calls.

Foundations & Theory

Autonomous Agent Advanced

System that independently pursues goals over time.

Agents & Autonomy

Mean Squared Error Intermediate

Average of squared residuals; common regression objective.

Optimization

Gradient Descent Intermediate

Iterative method that updates parameters in the direction of negative gradient to minimize loss.

Optimization

SHAP Intermediate

Feature attribution method grounded in cooperative game theory for explaining predictions in tabular settings.

Foundations & Theory

Gradient Clipping Intermediate

Limiting gradient magnitude to prevent exploding gradients.

AI Economics & Strategy

Energy-Based Model Intermediate

Models that define an energy landscape rather than explicit probabilities.

Model Architectures

Singular Value Decomposition Advanced

Decomposes a matrix into orthogonal components; used in embeddings and compression.

Mathematics

Cross-Attention Intermediate

Attention between different modalities.

Computer Vision

Expectation Advanced

Average value under a distribution.

Probability & Statistics

Cooperative Game Advanced

Agents optimize collective outcomes.

Agents & Autonomy

Model-Free RL Advanced

RL without explicit dynamics model.

Reinforcement Learning

LSTM Intermediate

An RNN variant using gates to mitigate vanishing gradients and capture longer context.

Foundations & Theory

SLAM Intermediate

Simultaneous Localization and Mapping for robotics.

Computer Vision

Scratchpad Intro

Temporary reasoning space (often hidden).

Prompting & Instructions

Proprioception Advanced

Internal sensing of joint positions, velocities, and forces.

Robotics & Embodied AI

Linear Quadratic Regulator Intermediate

Optimal control for linear systems with quadratic cost.

Foundations & Theory

Object Permanence Frontier

Understanding objects exist when unseen.

World Models & Cognition

Unauthorized Practice of Law Intermediate

AI giving legal advice without authorization.

AI in Law

Agent Intermediate

A system that perceives state, selects actions, and pursues goals—often combining LLM reasoning with tools and memory.

Agents & Autonomy

System Prompt Intermediate

A high-priority instruction layer setting overarching behavior constraints for a chat model.

Reinforcement Learning

Planner-Executor Intermediate

Separates planning from execution in agent architectures.

AI Economics & Strategy

Results for "state-action value"

Welcome to AI Glossary

Search

Browse

3D WordGraph