allenai / chime
Repository containing dataset, models and code associated with the CHIME project
☆14Updated 8 months ago
Alternatives and similar repositories for chime:
Users that are interested in chime are comparing it to the libraries listed below
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆40Updated 6 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆71Updated 3 weeks ago
- This repository contains expert evaluation interface and data evaluation script for the OpenScholar project.☆24Updated 5 months ago
- ☆11Updated 9 months ago
- Aioli: A unified optimization framework for language model data mixing☆25Updated 3 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- ☆63Updated last month
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆51Updated last month
- ☆20Updated 2 months ago
- ☆41Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 8 months ago
- Code/data for MARG (multi-agent review generation)☆43Updated 5 months ago
- Code for Benchmarking Language Model Agents for Data-Driven Science☆26Updated 6 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆84Updated 5 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- ☆27Updated 3 weeks ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 3 months ago
- Groq-powered MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆12Updated 10 months ago
- Reasoning by Communicating with Agents☆28Updated last week
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆35Updated 2 months ago
- Plug in and Play implementation of "Certified Reasoning with Language Models" that elevates model reasoning by 40%☆17Updated last year
- ☆19Updated last month
- ☆15Updated last month
- Based on the tree of thoughts paper☆48Updated last year
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆27Updated 2 months ago
- ☆13Updated 4 months ago
- ☆43Updated 6 months ago
- examples and guides to using Nomic Atlas☆34Updated 2 weeks ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago