A from-the-ground-up walkthrough of how modern LLMs work, from tokens to transformer blocks to the next-token loop