CS336: Language Modeling from Scratch
cs336.stanford.edu94 points by kristianpaul 2 hours ago
94 points by kristianpaul 2 hours ago
> GPU compute for self-study
Those suggestions they make for a B200 start at $4.99 an hour.
Is that really required, for starting out? I've been tinkering with my own from-scratch LLM, but in the early phases I don't need anything more than a 4090 on Vast.ai
I have fond memories of cs224d [1] taught by richardsocher. It’s a bit dated now as it was created in the pre-transformer era, but it was very cool introduction to applying deep learning to nlp at the time.
Thanks for releasing this again! What are this year's changes to prior offerings?
Are video lectures available online?
Youtube playlist link from the page https://www.youtube.com/watch?v=JuoVZkPBiKk&list=PLoROMvodv4...