allenai / codenavLinks
CodeNav is an LLM agent that navigates and leverages previously unseen code repositories to solve user queries.
☆64Updated last year
Alternatives and similar repositories for codenav
Users that are interested in codenav are comparing it to the libraries listed below
Sorting:
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆66Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Updated 2 years ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- ☆41Updated last year
- ☆86Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 6 months ago
- ☆136Updated 2 years ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆131Updated last year
- accompanying material for sleep-time compute paper☆118Updated 7 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆81Updated 11 months ago
- ☆49Updated last year
- ☆35Updated 6 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆110Updated 11 months ago
- SWE Arena☆35Updated 5 months ago
- ☆28Updated 3 weeks ago
- ☆105Updated 11 months ago
- ☆77Updated 2 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆55Updated 4 months ago
- ☆62Updated 5 months ago
- ☆41Updated 6 months ago
- Functional Benchmarks and the Reasoning Gap☆90Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 7 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- ☆126Updated 6 months ago
- ☆55Updated last year
- The first dense retrieval model that can be prompted like an LM☆89Updated 7 months ago
- Evaluating LLMs with fewer examples☆169Updated last year
- ☆88Updated last month
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 10 months ago