allenai / chimeLinks
Repository containing dataset, models and code associated with the CHIME project
☆15Updated 9 months ago
Alternatives and similar repositories for chime
Users that are interested in chime are comparing it to the libraries listed below
Sorting:
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆54Updated 3 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆105Updated 7 months ago
- Automated Qualitative Analysis of LLMs (ICLR 2025)☆38Updated 2 months ago
- Code/data for MARG (multi-agent review generation)☆43Updated 6 months ago
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆20Updated last month
- ☆16Updated 5 months ago
- Groq-powered MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆12Updated 11 months ago
- This repository contains expert evaluation interface and data evaluation script for the OpenScholar project.☆24Updated 6 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- ☆21Updated 3 months ago
- ☆41Updated 5 months ago
- CLI that uses DSPy to interact with MCP servers.☆17Updated 2 months ago
- ☆65Updated 2 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 4 months ago
- ☆26Updated 11 months ago
- Dataset and annotations for ASSETS 2022 publication☆12Updated 2 years ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆54Updated 2 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆87Updated last month
- Autonomous Generalist Scientist / AI Scientist / Agent Scientist / Robot Scientist☆17Updated 2 weeks ago
- ☆24Updated 8 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆89Updated 6 months ago
- Code for Benchmarking Language Model Agents for Data-Driven Science☆26Updated 7 months ago
- Easiest way to build custom agents, in a no-code notion style editor, using simple macros.☆27Updated 7 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆41Updated 7 months ago
- Code repo for MathAgent☆16Updated last year
- ☆34Updated last week
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆52Updated 2 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆37Updated 3 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆72Updated last month
- ☆40Updated last week