CS336: Language Modeling from Scratch

cs336.stanford.edu

94 points by kristianpaul 2 hours ago


skerit - 13 minutes ago

> GPU compute for self-study

Those suggestions they make for a B200 start at $4.99 an hour.

Is that really required, for starting out? I've been tinkering with my own from-scratch LLM, but in the early phases I don't need anything more than a 4090 on Vast.ai

meken - 42 minutes ago

I have fond memories of cs224d [1] taught by richardsocher. It’s a bit dated now as it was created in the pre-transformer era, but it was very cool introduction to applying deep learning to nlp at the time.

[1] https://cs224d.stanford.edu

storus - an hour ago

Thanks for releasing this again! What are this year's changes to prior offerings?

tmule - 41 minutes ago

Are video lectures available online?