Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆1,963Apr 7, 2026Updated 2 months ago
Alternatives and similar repositories for assignment1-basics
Users that are interested in assignment1-basics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆247May 1, 2026Updated last month
- ☆3,261May 28, 2026Updated 2 weeks ago
- ☆70Updated this week
- Stanford "Language Modeling from Scratch" CS336 Assignment1 - 斯坦福大学 CS336 课程作业1 个人实现,仅供参考☆46Jun 15, 2025Updated last year
- ☆55May 7, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆160Jun 4, 2026Updated last week
- ☆37Apr 19, 2026Updated last month
- Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆78Jul 7, 2025Updated 11 months ago
- A private repo for learning CS336☆37Sep 2, 2025Updated 9 months ago
- ☆73Jul 13, 2024Updated last year
- ☆97Jul 20, 2025Updated 10 months ago
- Advanced NLP, Fall 2025 https://cmu-l3.github.io/anlp-fall2025/☆63Jan 18, 2026Updated 4 months ago
- ☆431Dec 26, 2024Updated last year
- A PyTorch native library for training speculative decoding models☆159Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The BusTub Relational Database Management System (Educational)☆30Jan 19, 2024Updated 2 years ago
- ☆11Jun 21, 2025Updated 11 months ago
- 📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉☆11,245May 29, 2026Updated 2 weeks ago
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆82,482Updated this week
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,969Updated this week
- 由我制作的 ICS 讲稿☆38Nov 11, 2025Updated 7 months ago
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference☆45Mar 28, 2026Updated 2 months ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆37Feb 8, 2026Updated 4 months ago
- [NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection☆21Feb 3, 2024Updated 2 years ago
- SGLang is a high-performance serving framework for large language models and multimodal models.☆28,978Updated this week
- Nano vLLM☆14,031Apr 26, 2026Updated last month
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆57Mar 12, 2026Updated 3 months ago
- My implementation of Stanford CS336 assignments.☆239Mar 15, 2026Updated 3 months ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…☆9,619Jun 9, 2026Updated last week
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆96,979Jun 2, 2026Updated 2 weeks ago
- a simple variational auto encoder with some exploration☆12Nov 22, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆622Oct 7, 2025Updated 8 months ago
- My learning notes for ML SYS.☆6,520Jun 8, 2026Updated last week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆59,420Nov 12, 2025Updated 7 months ago
- Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O☆586Sep 13, 2025Updated 9 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,573Jul 1, 2024Updated last year
- Code for "Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation" (Findings of ACL 2024)☆16Jul 4, 2024Updated last year
- Course on Flash-attention in Triton☆99Feb 9, 2026Updated 4 months ago