Techniques & Methods

Autoregression

In language modeling, autoregression means generating text one token at a time, with each token conditioned on all previously generated tokens. The model probability P(w1, w2, ..., wn) is factored as a product of conditional probabilities, making generation inherently sequential.

All major LLMs (GPT, Claude, Gemini, Llama) are autoregressive. This left-to-right generation process is simple and scalable but means inference is sequential and cannot be parallelized across the output sequence dimension.

Authority Links

Autoregressive Model — Wikipedia

Statistical definition and AI applications of autoregressive models.

Hugging Face — Causal LM

Autoregressive language model training with transformers.

Related Terms

Model Components

Language Model

AI system that assigns probabilities to sequences of words and can generate coherent text.

Techniques & Methods

Sequence Generation

Process where models produce sequences—such as words or tokens—based on learned patterns.

Techniques & Methods

Generation

Producing new text, code, or content based on learned patterns and a given input prompt.

Model Components

Autoregressive Model

Model that generates each output element by conditioning on all previously generated elements.

Backpropagation Attention Mechanism