allenai / clin
☆81Updated last year
Alternatives and similar repositories for clin:
Users that are interested in clin are comparing it to the libraries listed below
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆79Updated 3 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆58Updated last week
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆74Updated last year
- augmented LLM with self reflection☆110Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆111Updated 2 months ago
- ☆23Updated 4 months ago
- ☆87Updated last week
- ☆116Updated 3 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆114Updated 7 months ago
- ☆98Updated last week
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated 10 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆125Updated 5 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated 11 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 5 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆136Updated last month
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆98Updated 4 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆140Updated 8 months ago
- ☆120Updated 7 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆66Updated 7 months ago
- ☆47Updated 2 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆47Updated last month
- ☆148Updated 2 weeks ago
- ☆52Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆90Updated last year
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆43Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆116Updated last year
- A benchmark for evaluating learning agents based on just language feedback☆64Updated 3 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆107Updated 8 months ago
- ☆94Updated 7 months ago