Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆1,364Aug 29, 2025Updated 6 months ago
Alternatives and similar repositories for assignment1-basics
Users that are interested in assignment1-basics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆194Jul 25, 2025Updated 8 months ago
- ☆2,783Jan 9, 2026Updated 2 months ago
- ☆48Jul 21, 2025Updated 8 months ago
- ☆124Jul 21, 2025Updated 8 months ago
- Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆77Jul 7, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆73Jul 13, 2024Updated last year
- ☆96Jul 20, 2025Updated 8 months ago
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference☆31Mar 19, 2026Updated last week
- ☆414Dec 26, 2024Updated last year
- ☆11Jun 21, 2025Updated 9 months ago
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- Substrate TypeScript SDK☆10Sep 20, 2024Updated last year
- 📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉☆10,022Updated this week
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆74,135Updated this week
- [NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection☆21Feb 3, 2024Updated 2 years ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆58Mar 12, 2026Updated 2 weeks ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,097Updated this week
- Nano vLLM☆12,353Nov 3, 2025Updated 4 months ago
- a simple variational auto encoder with some exploration☆12Nov 22, 2024Updated last year
- Data synthesis code for "AGENT: A Benchmark for Core Psychological Reasoning"☆24Mar 3, 2022Updated 4 years ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆603Oct 7, 2025Updated 5 months ago
- SGLang is a high-performance serving framework for large language models and multimodal models.☆24,829Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆89,206Updated this week
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)☆9,231Updated this week
- Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O☆561Sep 13, 2025Updated 6 months ago
- My learning notes for ML SYS.☆5,737Mar 19, 2026Updated last week
- hakken is a coding agent which needs hell lot of context☆31Dec 4, 2025Updated 3 months ago
- ☆11Dec 11, 2024Updated last year
- Course on Flash-attention in Triton☆98Feb 9, 2026Updated last month
- A Pytroch Implementation of Some Backdoor Attack Algorithms, Including BadNets, SIG, FIBA, FTrojan ...☆22Dec 7, 2024Updated last year
- Open-source framework for the research and development of foundation models.☆816Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)☆34Oct 16, 2024Updated last year
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,385Jul 1, 2024Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆55,432Nov 12, 2025Updated 4 months ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Jun 8, 2023Updated 2 years ago
- Demo of knowledge graph creation and Graph RAG with Dspy and Kuzu☆22Jun 30, 2025Updated 8 months ago
- ☆31Nov 30, 2025Updated 3 months ago
- This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."☆14,995Feb 22, 2026Updated last month