allenai / codenav
CodeNav is an LLM agent that navigates and leverages previously unseen code repositories to solve user queries.
☆32Updated 4 months ago
Alternatives and similar repositories for codenav:
Users that are interested in codenav are comparing it to the libraries listed below
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆47Updated last month
- ☆38Updated 5 months ago
- ☆47Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 4 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 2 months ago
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆21Updated 3 months ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆51Updated this week
- ☆30Updated 6 months ago
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆73Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆66Updated 6 months ago
- Functional Benchmarks and the Reasoning Gap☆82Updated 3 months ago
- ☆52Updated this week
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆100Updated last month
- ☆31Updated 7 months ago
- ☆74Updated last year
- ☆81Updated last year
- ☆35Updated last year
- ☆46Updated 2 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆49Updated 10 months ago
- ☆20Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆43Updated 8 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆100Updated last week
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆26Updated last month
- Train, tune, and infer Bamba model☆76Updated this week
- A repository re-creating the PromptBreeder Evolutionary Algorithm from the DeepMind Paper in Python using LMQL as the backend.☆27Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆68Updated 3 months ago
- Automatic Prompt Optimization☆25Updated 8 months ago