Results for "hidden objectives"

AdvertisementAd space — search-top

45 results

Boltzmann Machine Intermediate

Probabilistic energy-based neural network with hidden variables.

Model Architectures
Hidden Markov Model Intermediate

Probabilistic model for sequential data with latent states.

Model Architectures
Specification Gaming Advanced

Model exploits poorly specified objectives.

AI Safety & Alignment
Mesa-Optimizer Advanced

Learned subsystem that optimizes its own objective.

AI Safety & Alignment
Backdoor / Trojan Intermediate

Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.

Foundations & Theory
Prompt Leakage Intermediate

Extracting system prompts or hidden instructions.

AI Economics & Strategy
Restricted Boltzmann Machine Intermediate

Simplified Boltzmann Machine with bipartite structure.

Model Architectures
Scratchpad Intro

Temporary reasoning space (often hidden).

Prompting & Instructions
Reward Hacking Advanced

Maximizing reward without fulfilling real goal.

AI Safety & Alignment
Instrumental Convergence Advanced

Tendency for agents to pursue resources regardless of final goal.

AI Safety & Alignment
Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment
Outer Alignment Advanced

Correctly specifying goals.

AI Safety & Alignment
Corrigibility Advanced

Willingness of system to accept correction or shutdown.

AI Safety & Alignment
Competitive Game Advanced

Agents have opposing objectives.

Agents & Autonomy
Unsupervised Learning Intermediate

Learning structure from unlabeled data, such as discovering groups, compressing representations, or modeling data distributions.

Machine Learning
Neural Network Intermediate

A parameterized function composed of interconnected units organized in layers with nonlinear activations.

Neural Networks
Recurrent Neural Network Intermediate

Networks with recurrent connections for sequences; largely supplanted by Transformers for many tasks.

Neural Networks
Universal Approximation Theorem Intermediate

Neural networks can approximate any continuous function under certain conditions.

AI Economics & Strategy
Confounding Intermediate

A hidden variable influences both cause and effect, biasing naive estimates of causal impact.

Foundations & Theory
Bottleneck Layer Intermediate

A narrow hidden layer forcing compact representations.

AI Economics & Strategy
Speech Recognition Intermediate

Converting audio speech into text, often using encoder-decoder or transducer architectures.

Speech & Audio AI
Acoustic Model Intermediate

Maps audio signals to linguistic units.

Speech & Audio AI
Prosody Intermediate

Temporal and pitch characteristics of speech.

Speech & Audio AI
State Space Model Intermediate

Models time evolution via hidden states.

Time Series
Intent Recognition Frontier

Inferring human goals from behavior.

World Models & Cognition
Self-Supervised Learning Intermediate

Learning from data by constructing “pseudo-labels” (e.g., next-token prediction, masked modeling) without manual annotation.

Machine Learning
Multi-Agent System Intermediate

Multiple agents interacting cooperatively or competitively.

AI Economics & Strategy
Emergent Coordination Intermediate

Coordination arising without explicit programming.

AI Economics & Strategy
Mode Collapse Advanced

Generator produces limited variety of outputs.

Diffusion & Generative Models
Hierarchical Planning Advanced

Decomposing goals into sub-tasks.

Agents & Autonomy

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.