LLM Mental Model

learn concepts related to mental model of LLMs, how they work, how to use them effectively, and how to build applications on top of them.

📄️Overview

Most discussions about LLMs focus on prompts, tools, and frameworks. However, few explain how the model actually works under the hood and why that matters when building real systems.

📄️Before Training

Hyperparameters

📄️During Training

Start of training - randomly initialised

📄️Alignment Training

📄️After Training

The model is now frozen