stanford-cs336 / spring2024-lectures
☆153Updated last week
Related projects ⓘ
Alternatives and complementary repositories for spring2024-lectures
- A bibliography and survey of the papers surrounding o1☆577Updated this week
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆251Updated last year
- LLM-Merging: Building LLMs Efficiently through Merging☆174Updated last month
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆156Updated 3 months ago
- Language models scale reliably with over-training and on downstream tasks☆94Updated 7 months ago
- The official evaluation suite and dynamic data release for MixEval.☆222Updated last week
- ☆63Updated last month
- A Survey on Data Selection for Language Models☆178Updated 3 weeks ago
- RuLES: a benchmark for evaluating rule-following in language models☆210Updated last month
- RewardBench: the first evaluation tool for reward models.☆424Updated 2 weeks ago
- Understand and test language model architectures on synthetic tasks.☆161Updated 6 months ago
- ☆89Updated 4 months ago
- ☆112Updated 3 months ago
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆120Updated 2 weeks ago
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆332Updated 2 months ago
- Can Language Models Solve Olympiad Programming?☆100Updated 3 months ago
- ☆149Updated 6 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆95Updated 2 months ago
- Building blocks for foundation models.☆386Updated 10 months ago
- This repository collects all relevant resources about interpretability in LLMs☆282Updated last week
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆168Updated last month
- A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)☆137Updated last month
- Official repository for ORPO☆420Updated 5 months ago
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆134Updated 4 months ago
- Scaling Data-Constrained Language Models☆321Updated last month
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆435Updated 7 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆153Updated 3 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆738Updated last week
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆302Updated 6 months ago
- ☆125Updated 9 months ago