Logit

Logits are the raw, unnormalized scores output by a neural network's final layer before softmax converts them into probabilities. For language models, the output layer produces one logit per token in the vocabulary, where higher logits indicate higher model confidence that the corresponding token should come next. The term 'logit' comes from statistics, referring to the log-odds of an event.

In neural networks, logits are the pre-activation values that softmax exponentiates and normalizes. Converting logits to probabilities: p(token_i) = exp(logit_i) / sum(exp(all_logits)). The relative differences between logits matter more than absolute values since softmax normalizes everything. Logit manipulation enables fine-grained control over model behavior.

Classifier-free guidance in diffusion models works by scaling logits. Constitutional AI modifies logits to discourage harmful outputs. Logit bias in API calls increases or decreases the probability of specific tokens. Temperature scaling divides logits before softmax: higher temperature flattens the distribution (more randomness), lower temperature sharpens it (more determinism).

Understanding logits is required for advanced prompting techniques, model interpretability, and controlling generation behavior beyond simple sampling parameters.

Interactive Concept: logit

Logit Visualization

Adjust logits and see how they transform into probabilities through softmax

"the"

Logit:2.1

2.1

"cat"

Logit:-0.5

-0.5

"dog"

Logit:1.8

1.8

"runs"

Logit:0.3

0.3

"sits"

Logit:-1.2

-1.2

"quickly"

Logit:0.7

0.7

Key Insights:

• Logits are raw scores - they can be any real number

• Softmax converts logits to probabilities (0-100%, sum to 100%)

• Higher logits don't directly show probability - relative differences matter

• The highest logit (highlighted in purple) gets the highest probability

Understanding logits is required for advanced prompting techniques, model interpretability, and controlling generation behavior beyond simple sampling parameters.

Interactive Concept: logit

Logit Visualization

Adjust logits and see how they transform into probabilities through softmax

"the"

Logit:2.1

2.1

"cat"

Logit:-0.5

-0.5

"dog"

Logit:1.8

1.8

"runs"

Logit:0.3

0.3

"sits"

Logit:-1.2

-1.2

"quickly"

Logit:0.7

0.7

Key Insights:

• Logits are raw scores - they can be any real number

• Softmax converts logits to probabilities (0-100%, sum to 100%)

• Higher logits don't directly show probability - relative differences matter

• The highest logit (highlighted in purple) gets the highest probability

Logit Visualization

Related Terms

Logit

Logit Visualization

Related Terms