Glossary definitionBrowse the neighboring terms

Training / Standard term

Post-training

Everything that happens after a model's initial pretraining to turn it from a raw text predictor into a usable assistant, including instruction tuning, preference optimization, safety training, and behavioral adjustments.

Post-training is everything that happens after a model's initial pretraining to turn it from a raw text predictor into a usable assistant. These stages typically include instruction tuning, preference learning such as reinforcement learning from human feedback (RLHF) or direct preference optimization (DPO), safety training, and personality adjustments. Two models built on the same pretrained base can feel completely different after post-training: one might be concise and direct, another verbose and cautious, depending on how this stage was done.

Builder example

Post-training is why model selection is more nuanced than comparing parameter counts or benchmark scores. The same base model can produce wildly different experiences depending on how it was post-trained. When you notice that one model refuses too often while another is too agreeable, or that one follows formatting instructions well while another ignores them, you are seeing different post-training choices at work.

Common confusion: Post-training is an umbrella term covering fine-tuning, reinforcement learning from human feedback (RLHF), safety training, and more. Fine-tuning is one technique within post-training. The two terms are not synonyms.