Stanford CS336 | Language Modeling from Scratch

Stanford CS336 | Language Modeling from Scratch

Stanford CS336: Language Modeling from Scratch is an advanced deep learning and AI systems course focused on building large language models (LLMs) from the ground up. The course provides a detailed understanding of the engineering, mathematics, and infrastructure behind modern generative AI systems.

The course begins with an overview of language modeling fundamentals, including tokenization techniques and how text data is transformed into machine-readable representations for neural networks.

Learners are introduced to PyTorch and computational resource accounting, helping