Monday, April 6, 2026

Show HN: I built a tiny LLM to demystify how language models work

Show HN: I built a tiny LLM to demystify how language models work
544 by armanified | 59 comments on Hacker News.
Built a ~9M param LLM from scratch to understand how they actually work. Vanilla transformer, 60K synthetic conversations, ~130 lines of PyTorch. Trains in 5 min on a free Colab T4. The fish thinks the meaning of life is food. Fork it and swap the personality for your own character.