microsoft / ToRA
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
☆980Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for ToRA
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,364Updated last year
- Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".☆765Updated 7 months ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,045Updated 6 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,336Updated 7 months ago
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆1,667Updated last year
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,529Updated last week
- YaRN: Efficient Context Window Extension of Large Language Models☆1,353Updated 7 months ago
- Code for Quiet-STaR☆651Updated 3 months ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,224Updated last week
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆582Updated last year
- A library for advanced large language model reasoning☆1,442Updated last week
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆675Updated 7 months ago
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models☆835Updated 7 months ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆625Updated last month
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)☆332Updated 2 months ago
- LOMO: LOw-Memory Optimization☆979Updated 4 months ago
- ☆1,271Updated this week
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆536Updated last year
- ☆708Updated 5 months ago
- prompt2model - Generate Deployable Models from Natural Language Instructions☆1,964Updated 6 months ago
- Codebase for Merging Language Models (ICML 2024)☆774Updated 6 months ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆886Updated 3 weeks ago
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆782Updated 4 months ago
- 🩹Editing large language models within 10 seconds⚡☆1,284Updated last year
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,008Updated 10 months ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆613Updated 5 months ago
- kani (カニ) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)☆558Updated this week
- Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"☆2,178Updated last month
- [ACL 2024] Progressive LLaMA with Block Expansion.☆478Updated 6 months ago
- Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment☆1,021Updated 5 months ago