aymeric-roucher / agent_reasoning_benchmark
🔧 Compare how Agent systems perform on several benchmarks. 📊🚀
☆89Updated 5 months ago
Alternatives and similar repositories for agent_reasoning_benchmark:
Users that are interested in agent_reasoning_benchmark are comparing it to the libraries listed below
- ☆117Updated 7 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆103Updated last month
- ☆160Updated 7 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆104Updated 6 months ago
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 6 months ago
- Simple examples using Argilla tools to build AI☆53Updated 4 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆107Updated 9 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆121Updated 9 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆66Updated 8 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆207Updated 4 months ago
- ☆114Updated 6 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆106Updated last month
- ☆74Updated last year
- ☆33Updated 8 months ago
- AWM: Agent Workflow Memory☆252Updated last month