allenai / recoma
Reasoning by Communicating with Agents
☆19Updated last month
Related projects: ⓘ
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆42Updated 10 months ago
- Repository for Skill Set Optimization☆12Updated last month
- A library for computing diverse text characteristics and using them to analyze data sets and models with ease.☆39Updated 2 years ago
- ☆52Updated 7 months ago
- ☆17Updated 6 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆27Updated this week
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- ☆43Updated 11 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆40Updated 6 months ago
- Code and Data for the NAACL 24 paper: MacGyver: Are Large Language Models Creative Problem Solvers?☆21Updated 5 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆42Updated 8 months ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆33Updated last week
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆11Updated 10 months ago
- ☆13Updated 3 months ago
- Efficient Memory-Augmented Transformers☆34Updated last year
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆25Updated last year
- Code for the paper "LASER: LLM Agent with State-Space Exploration for Web Navigation"☆31Updated 11 months ago
- Apps built using Inspired Cognition's Critique.☆58Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆34Updated 3 weeks ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆27Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Retrieval Augmented Generation Generalized Evaluation Dataset☆51Updated this week
- Repository for paper Tools Are Instrumental for Language Agents in Complex Environments☆32Updated 8 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆44Updated 8 months ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆51Updated last year
- ☆18Updated 3 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆39Updated 7 months ago
- ☆38Updated 5 months ago
- A unified benchmark for math reasoning☆87Updated last year