Overview
Most discussions about LLMs focus on prompts, tools, and frameworks. However, few explain how the model actually works under the hood and why that matters when building real systems.
Before Training
Hyperparameters
During Training
Start of training - randomly initialised
Alignment Training
After Training
The model is now frozen