Temperature

Temperature is a parameter controlling the randomness of language model output by scaling the logits (pre-softmax scores) before sampling the next token, directly shaping the creativity-accuracy tradeoff in generation.

Mathematically, temperature divides logits before softmax: lower temperatures make the probability distribution sharper (concentrating probability on likely tokens), while higher temperatures flatten it (spreading probability across more options). At temperature 0, the model becomes deterministic, always selecting the highest-probability token (greedy decoding).

At temperature 1, the model samples directly from its learned distribution. At temperatures above 1, unlikely tokens receive elevated probability, producing more varied but potentially incoherent output. Most applications use temperatures between 0 and 1. 3) suit tasks requiring accuracy and consistency: code generation, data extraction, factual Q&A.

8) balance creativity and coherence for general conversation and writing. 2) generate diverse, creative content but risk incoherence. Temperature interacts with other sampling parameters: top-k and top-p truncate the distribution before temperature-adjusted sampling. The optimal temperature depends on the task, model, and desired output characteristics.

Production systems often use different temperatures for different features: low for structured outputs, higher for creative suggestions. Understanding temperature is core to prompt engineering and API usage.

Interactive Concept: temperature

Temperature in Language Models

Adjust temperature to see how it affects token probability distribution. Lower values make the model more confident and deterministic, while higher values increase randomness and creativity.

Temperature1.00

Deterministic (0.1)Creative (2.0)

Next Token Probabilities

the

45.1%

22.4%

beautiful

12.3%

amazing

9.1%

terrible

5.5%

purple

3.0%

quantum

1.7%

zebra

0.8%

2.23

Entropy (bits)

45.1%

Top Token Prob

Medium

Randomness

Interactive Concept: temperature

Temperature in Language Models

Adjust temperature to see how it affects token probability distribution. Lower values make the model more confident and deterministic, while higher values increase randomness and creativity.

Temperature1.00

Deterministic (0.1)Creative (2.0)

Next Token Probabilities

the

45.1%

22.4%

beautiful

12.3%

amazing

9.1%

terrible

5.5%

purple

3.0%

quantum

1.7%

zebra

0.8%

2.23

Entropy (bits)

45.1%

Top Token Prob

Medium

Randomness

Temperature in Language Models

Next Token Probabilities

Related Terms

Temperature

Temperature in Language Models

Next Token Probabilities

Related Terms