stanford-cs336 / spring2024-lectures
☆260Updated 4 months ago
Alternatives and similar repositories for spring2024-lectures:
Users that are interested in spring2024-lectures are comparing it to the libraries listed below
- ☆85Updated 7 months ago
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆50Updated last month
- A brief and partial summary of RLHF algorithms.☆128Updated 2 months ago
- ☆181Updated 2 months ago
- A bibliography and survey of the papers surrounding o1☆1,190Updated 5 months ago
- ☆69Updated this week
- ☆159Updated 4 months ago
- Notes and commented code for RLHF (PPO)☆90Updated last year
- What would you do with 1000 H100s...☆1,043Updated last year
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆307Updated 5 months ago
- ☆287Updated last month
- Explorations into some recent techniques surrounding speculative decoding☆261Updated 4 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆459Updated last year
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆336Updated 2 weeks ago
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆57Updated 2 weeks ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆208Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆255Updated last year
- An extension of the nanoGPT repository for training small MOE models.☆138Updated last month
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆291Updated this week
- LLaMA 2 implemented from scratch in PyTorch☆322Updated last year
- Building blocks for foundation models.☆487Updated last year
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆322Updated 4 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆436Updated 3 weeks ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆788Updated this week
- A repository for research on medium sized language models.☆495Updated last week
- ☆192Updated 2 months ago
- A project to improve skills of large language models☆354Updated this week
- The official evaluation suite and dynamic data release for MixEval.☆238Updated 5 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆837Updated this week
- Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training☆269Updated last week