Agent-E3 / ExACT
☆14Updated 2 weeks ago
Alternatives and similar repositories for ExACT:
Users that are interested in ExACT are comparing it to the libraries listed below
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆33Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated this week
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆78Updated 3 weeks ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆45Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- ☆40Updated last month
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆36Updated 3 months ago
- ☆65Updated 4 months ago
- ☆103Updated 2 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆83Updated last week
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆75Updated 2 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆52Updated 3 months ago
- NeurIPS 2024 tutorial on LLM Inference☆39Updated 3 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆46Updated 4 months ago
- ☆23Updated 6 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆95Updated 2 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- ☆20Updated 10 months ago
- ☆26Updated 8 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆52Updated last year
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆44Updated last month
- ☆60Updated 11 months ago
- o1 Chain of Thought Examples☆33Updated 5 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆17Updated 2 months ago
- Repository for Skill Set Optimization☆12Updated 8 months ago
- CodeUltraFeedback: aligning large language models to coding preferences☆71Updated 9 months ago
- ☆96Updated 9 months ago