allenai / chimeLinks
Repository containing dataset, models and code associated with the CHIME project
☆16Updated last year
Alternatives and similar repositories for chime
Users that are interested in chime are comparing it to the libraries listed below
Sorting:
- ☆43Updated last year
- Welcome to ResearchAgent ! A personal research assistant powered by GPT-3.5/GPT-4. You can ask follow up questions. Get source details o…☆36Updated 2 years ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- Dataset and annotations for ASSETS 2022 publication☆12Updated 3 years ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- Groq-powered MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆12Updated last year
- [ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.☆84Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆53Updated 6 months ago
- ☆39Updated last year
- Evaluating LLMs with fewer examples☆169Updated last year
- MIRIAD is a million-scale Medical Instruction and Retrieval Datatset☆139Updated last month
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆122Updated 4 months ago
- Open Implementations of LLM Analyses☆107Updated last year
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Updated 7 months ago
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆101Updated 2 months ago
- Specification for creating reliable LLM-based conversational agents☆63Updated 3 months ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆31Updated 9 months ago
- CLI that uses DSPy to interact with MCP servers.☆23Updated 10 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 5 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆220Updated 5 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆114Updated 9 months ago
- Discovering Data-driven Hypotheses in the Wild☆127Updated 7 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆93Updated 5 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 11 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆141Updated last year
- ☆61Updated 6 months ago
- 🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"☆106Updated 9 months ago
- ☆24Updated 10 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Updated last year