CogComp / reasoning-evalLinks
☆23Updated last year
Alternatives and similar repositories for reasoning-eval
Users that are interested in reasoning-eval are comparing it to the libraries listed below
Sorting:
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month
- CAMEL framework-based multi-agent system for task-driven and dynamic environments☆105Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated last year
- Q-Star Agent Code: A reinforcement learning-based framework for intelligent agents using Microsoft AutoGen. It leverages Q-Star, a Q-lear…☆90Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆128Updated 11 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆72Updated 2 months ago
- A collection of interesting links, articles, research papers and projects related to knowledge graphs, GenAI and LLMs (large language mod…☆26Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- LLM reads a paper and produce a working prototype☆60Updated 9 months ago
- Tutorial for DSPy☆26Updated last year
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆93Updated last year
- Shugok-AI is a Streamlit-based web application that makes AI research papers from arXiv more accessible by simplifying their academic lan…☆24Updated last year
- 🚀 The LLM Automatic Computer Framework: L2MAC☆145Updated last year
- ☆33Updated last year
- GraphRAG database - hybrid graph / vector db☆134Updated last year
- The Library for LLM-based multi-agent applications☆103Updated 6 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆99Updated 3 months ago
- ☆61Updated 7 months ago
- ☆39Updated last year
- ☆40Updated last year
- ☆43Updated 2 months ago
- ☆90Updated 11 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆56Updated 5 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆82Updated last year
- ☆30Updated last year
- Codebase from our first release.☆41Updated 3 weeks ago
- ☆80Updated 4 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆60Updated 11 months ago
- ☆45Updated last year
- Open Agent Computer Interface☆91Updated last year