Logits

Raw model outputs before converting to probabilities; manipulated during decoding and calibration.

Why It Matters

Logits are fundamental in the functioning of neural networks, serving as the basis for generating probabilities in classification tasks. Their manipulation during the decoding process is crucial for achieving high-quality outputs in AI applications, impacting everything from language translation to image recognition.

Logits are the raw, unnormalized outputs produced by a neural network before they are transformed into probabilities through the softmax function. Mathematically, logits can be represented as the output of the last layer of a neural network, typically denoted as z, where z = W * x + b, with W representing the weight matrix, x the input vector, and b the bias vector. Logits serve as the basis for calculating probabilities, where the softmax function converts them into a probability distribution over the output classes. This transformation is crucial for tasks such as classification and sequence generation, as it allows the model to make probabilistic predictions. Logits are also manipulated during decoding processes, such as in beam search or sampling methods, to influence the selection of outputs based on their relative likelihoods.

Keywords

pre-softmax scores

Domains

Foundations & Theory

Related Terms

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 3

3D WordGraph

Full 3D WordGraph

Click a connected term to explore it. The center node is Logits.

Relationship Types

related to broader / narrower prerequisite of contrasts with used in

Why It Matters

Keywords

Domains

Related Terms

Welcome to AI Glossary

Search

Browse

3D WordGraph