Search: supervised finetune

Supervised Learning Intermediate

Learning a function from input-output pairs (labeled data), optimizing performance on predicting outputs for unseen inputs.

Machine Learning

Semi-Supervised Learning Intermediate

Training with a small labeled dataset plus a larger unlabeled dataset, leveraging assumptions like smoothness/cluster structure.

Machine Learning

Self-Supervised Learning Intermediate

Learning from data by constructing “pseudo-labels” (e.g., next-token prediction, masked modeling) without manual annotation.

Machine Learning

Data Labeling Intermediate

Human or automated process of assigning targets; quality, consistency, and guidelines matter heavily.

Foundations & Theory

Machine Learning Intermediate

A subfield of AI where models learn patterns from data to make predictions or decisions, improving with experience rather than explicit rule-coding.

Machine Learning

Unsupervised Learning Intermediate

Learning structure from unlabeled data, such as discovering groups, compressing representations, or modeling data distributions.

Machine Learning

Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning

Dataset Intermediate

A structured collection of examples used to train/evaluate models; quality, bias, and coverage often dominate outcomes.

Machine Learning

Loss Function Intermediate

A function measuring prediction error (and sometimes calibration), guiding gradient-based optimization.

Foundations & Theory

Confusion Matrix Intermediate

A table summarizing classification outcomes, foundational for metrics like precision, recall, specificity.

Foundations & Theory

Fine-Tuning Intermediate

Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.

Large Language Models

SFT Intermediate

Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.

Foundations & Theory

Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory

Safety Filter Intermediate

Automated detection/prevention of disallowed outputs (toxicity, self-harm, illegal instruction, etc.).

Foundations & Theory

Inter-Annotator Agreement Intermediate

Measure of consistency across labelers; low agreement indicates ambiguous tasks or poor guidelines.

Foundations & Theory

Active Learning Intermediate

Selecting the most informative samples to label (e.g., uncertainty sampling) to reduce labeling cost.

Foundations & Theory

Bias Term Intermediate

Systematic error introduced by simplifying assumptions in a learning algorithm.

AI Economics & Strategy

Image Classification Intermediate

Assigning category labels to images.

Computer Vision

Reflection Prompting Intro

Asking model to review and improve output.

Prompting & Instructions

Overgeneralization Intermediate

Applying learned patterns incorrectly.

Model Failure Modes

Feedback Loop Collapse Intermediate

Model trained on its own outputs degrades quality.

Model Failure Modes

Dynamics Model Advanced

Predicts next state given current state and action.

Reinforcement Learning

Imitation Learning Advanced

Learning policies from expert demonstrations.

Reinforcement Learning

Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning

World Model Frontier

Learned model of environment dynamics.

World Models & Cognition

Intent Recognition Frontier

Inferring human goals from behavior.

World Models & Cognition

E-Discovery Intermediate

AI-assisted review of legal documents.

AI in Law

Fraud Detection Intermediate

Identifying suspicious transactions.

AI Economics & Strategy

AlphaFold Advanced

Deep learning system for protein structure prediction.

AI in Science

Narrow AI Frontier

AI limited to specific domains.

AGI & General Intelligence

Results for "supervised finetune"

Welcome to AI Glossary

Search

Browse

3D WordGraph