allenai / clin
☆78Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for clin
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆72Updated 10 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆39Updated last month
- ☆46Updated last week
- ☆112Updated last month
- ☆40Updated 2 weeks ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- ☆22Updated 2 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆91Updated 3 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆97Updated last month
- ☆103Updated 3 months ago
- ☆37Updated 3 weeks ago
- augmented LLM with self reflection☆102Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆78Updated 8 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆110Updated 3 weeks ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆83Updated last week
- ☆137Updated 6 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆40Updated last month
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆46Updated last month
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆87Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆62Updated 5 months ago
- The first dense retrieval model that can be prompted like an LM☆63Updated 2 months ago
- ☆102Updated last month
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 9 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆128Updated 3 weeks ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆49Updated 8 months ago
- ☆35Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆113Updated last year
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆80Updated 2 months ago