Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆1,558Apr 7, 2026Updated 3 weeks ago
Alternatives and similar repositories for assignment1-basics
Users that are interested in assignment1-basics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆215Updated this week
- ☆2,922Apr 29, 2026Updated last week
- ☆56Updated this week
- Stanford "Language Modeling from Scratch" CS336 Assignment1 - 斯坦福大学 CS336 课程作业1 个人实现,仅供参考☆44Jun 15, 2025Updated 10 months ago
- ☆141Mar 30, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆49Mar 30, 2026Updated last month
- Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆78Jul 7, 2025Updated 9 months ago
- ☆71Jul 13, 2024Updated last year
- ☆96Jul 20, 2025Updated 9 months ago
- Advanced NLP, Fall 2025 https://cmu-l3.github.io/anlp-fall2025/☆61Jan 18, 2026Updated 3 months ago
- 记录我在cs336学习时的笔记和作业☆819Mar 30, 2026Updated last month
- This is a cross-chip platform collection of operators and a unified neural network library.☆17Nov 3, 2023Updated 2 years ago
- This is a official repository for MExD☆20Oct 27, 2025Updated 6 months ago
- ☆424Dec 26, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 2025华为软件精英挑战赛 总决赛最佳大模型应用奖☆38Apr 22, 2025Updated last year
- ☆13Aug 13, 2025Updated 8 months ago
- ☆11Jun 21, 2025Updated 10 months ago
- 📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉☆10,865Updated this week
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- ☆36Feb 8, 2026Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆78,979Updated this week
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,046Updated this week
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection☆21Feb 3, 2024Updated 2 years ago
- My implementation of Stanford CS336 assignments.☆239Mar 15, 2026Updated last month
- SGLang is a high-performance serving framework for large language models and multimodal models.☆26,832Updated this week
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…☆9,441Updated this week
- Nano vLLM☆13,219Apr 26, 2026Updated last week
- Data synthesis code for "AGENT: A Benchmark for Core Psychological Reasoning"☆24Mar 3, 2022Updated 4 years ago
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆91,948Apr 16, 2026Updated 2 weeks ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆609Oct 7, 2025Updated 6 months ago
- Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O☆573Sep 13, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,462Jul 1, 2024Updated last year
- My learning notes for ML SYS.☆6,166Apr 23, 2026Updated 2 weeks ago
- Course on Flash-attention in Triton☆98Feb 9, 2026Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆57,469Nov 12, 2025Updated 5 months ago
- CMU 15-712 lecture slides☆11Jan 6, 2020Updated 6 years ago
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)☆34Oct 16, 2024Updated last year
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago