lee-ny / teaching_arithmetic
☆78Updated last year
Alternatives and similar repositories for teaching_arithmetic:
Users that are interested in teaching_arithmetic are comparing it to the libraries listed below
- ☆171Updated last year
- ☆89Updated last year
- ☆34Updated 10 months ago
- ☆51Updated 9 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆100Updated 10 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆115Updated 5 months ago
- Language models scale reliably with over-training and on downstream tasks☆96Updated 10 months ago
- ☆86Updated last week
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆70Updated 3 months ago
- ☆58Updated 9 months ago
- ☆80Updated 11 months ago
- ☆95Updated 7 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆121Updated 3 months ago
- A brief and partial summary of RLHF algorithms.☆93Updated 2 months ago
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆50Updated 4 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆52Updated 10 months ago
- ☆79Updated 7 months ago
- ☆44Updated 6 months ago
- ☆46Updated last year
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆16Updated 3 months ago
- ☆133Updated 2 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆138Updated 4 months ago
- ☆33Updated 3 months ago
- Understand and test language model architectures on synthetic tasks.☆181Updated last month
- A library for efficient patching and automatic circuit discovery.☆53Updated this week
- Replicating O1 inference-time scaling laws☆82Updated 2 months ago
- ☆109Updated 6 months ago
- ☆76Updated 6 months ago
- ☆114Updated 7 months ago
- nanoGPT-like codebase for LLM training☆89Updated this week