CogComp / reasoning-evalLinks
☆23Updated last year
Alternatives and similar repositories for reasoning-eval
Users that are interested in reasoning-eval are comparing it to the libraries listed below
Sorting:
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆89Updated last week
- CAMEL framework-based multi-agent system for task-driven and dynamic environments☆105Updated last year
- 🚀 The LLM Automatic Computer Framework: L2MAC☆144Updated 11 months ago
- Q-Star Agent Code: A reinforcement learning-based framework for intelligent agents using Microsoft AutoGen. It leverages Q-Star, a Q-lear…☆86Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆72Updated last month
- LLM reads a paper and produce a working prototype☆60Updated 8 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆235Updated 9 months ago
- GraphRAG database - hybrid graph / vector db☆134Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 10 months ago
- Tutorial for DSPy☆25Updated last year
- ☆42Updated last year
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆58Updated 9 months ago
- Specification for creating reliable LLM-based conversational agents☆64Updated last month
- Shugok-AI is a Streamlit-based web application that makes AI research papers from arXiv more accessible by simplifying their academic lan…☆24Updated 11 months ago
- ☆43Updated last month
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆93Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 4 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆124Updated 10 months ago
- Powerful Auto Research powered by LangChain, and Anthropic.☆29Updated last year
- Open Agent Computer Interface☆89Updated last year
- ☆33Updated last year
- ☆45Updated last year
- This tool allows you to search ArXiv for scientific papers, extract their content, embed and chunk the text, and ask questions about them…☆34Updated last year
- Graphlit Platform☆25Updated last year
- ☆89Updated 10 months ago
- Simple Graph Memory for AI applications☆89Updated 7 months ago
- Dynamic Metadata based RAG Framework☆78Updated last week
- ☆62Updated last year