allenai / clinLinks
☆84Updated last year
Alternatives and similar repositories for clin
Users that are interested in clin are comparing it to the libraries listed below
Sorting:
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆87Updated last year
- ☆24Updated 10 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆59Updated 8 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆140Updated 8 months ago
- ☆119Updated 5 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆97Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆120Updated 9 months ago
- ☆125Updated 10 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆125Updated last year
- ☆34Updated 2 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆104Updated 2 weeks ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆113Updated last year
- ☆20Updated last month
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 7 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆36Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆156Updated 6 months ago
- ☆122Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆150Updated 6 months ago
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆268Updated last year
- ☆143Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆99Updated 2 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- ☆54Updated last month
- accompanying material for sleep-time compute paper☆102Updated 3 months ago
- ☆46Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- ☆41Updated last year
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆98Updated last month
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆109Updated 10 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆311Updated 9 months ago