stanford-cs336 / assignment1-basics
Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆67Updated 3 weeks ago
Alternatives and similar repositories for assignment1-basics
Users that are interested in assignment1-basics are comparing it to the libraries listed below
Sorting:
- ☆85Updated 7 months ago
- ☆108Updated last week
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆51Updated last month
- minimal GRPO implementation from scratch☆88Updated 2 months ago
- An extension of the nanoGPT repository for training small MOE models.☆140Updated 2 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆180Updated this week
- ☆267Updated 4 months ago
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆143Updated last week
- NeurIPS 2024 tutorial on LLM Inference☆43Updated 5 months ago
- Notes and commented code for RLHF (PPO)☆90Updated last year
- Simple repository for training small reasoning models☆27Updated 3 months ago
- ☆163Updated 4 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆105Updated this week
- ☆40Updated 10 months ago
- Distributed training (multi-node) of a Transformer model☆66Updated last year
- A brief and partial summary of RLHF algorithms.☆128Updated 2 months ago
- ☆186Updated 3 months ago
- This repository contain the simple llama3 implementation in pure jax.☆63Updated 2 months ago
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆103Updated last week
- ☆32Updated 2 months ago
- Official Implementation of "Reasoning Language Models: A Blueprint"☆59Updated 3 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆46Updated 11 months ago
- Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆77Updated 3 weeks ago
- Tina: Tiny Reasoning Models via LoRA☆192Updated 3 weeks ago
- Notes on Direct Preference Optimization☆19Updated last year
- making the official triton tutorials actually comprehensible☆28Updated last month
- ☆76Updated 10 months ago
- ☆60Updated last year
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆105Updated last year
- SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning☆261Updated this week