GFNOrg / gfn-lm-tuning
☆127Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for gfn-lm-tuning
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆97Updated 2 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆80Updated last week
- ☆73Updated 4 months ago
- ☆76Updated 9 months ago
- ☆81Updated last year
- Function Vectors in Large Language Models (ICLR 2024)☆119Updated last month
- ☆50Updated 6 months ago
- ☆70Updated last year
- ☆98Updated 3 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆105Updated 7 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆84Updated 7 months ago
- Language models scale reliably with over-training and on downstream tasks☆94Updated 7 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆95Updated 6 months ago
- ☆90Updated 4 months ago
- ☆107Updated this week
- ☆105Updated last month
- A repository for transformer critique learning and generation☆86Updated 11 months ago
- ☆69Updated 8 months ago
- ☆44Updated last year
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆98Updated last year
- Can Language Models Solve Olympiad Programming?☆100Updated 3 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆46Updated 5 months ago
- ☆71Updated 3 months ago
- NanoGPT-like codebase for LLM training☆75Updated this week
- ☆114Updated 4 months ago
- A library for efficient patching and automatic circuit discovery.☆31Updated last month
- GenRM-CoT: Data release for verification rationales☆23Updated last month
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆48Updated 7 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆91Updated 3 months ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆127Updated 6 months ago