gabegrand / liloLinks
LILO: Library Induction with Language Observations
☆90Updated last year
Alternatives and similar repositories for lilo
Users that are interested in lilo are comparing it to the libraries listed below
Sorting:
- ☆105Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆132Updated last year
- Evaluating LLMs with CommonGen-Lite☆93Updated last year
- Prototype advanced LLM algorithms for reasoning and planning.☆99Updated last year
- ☆139Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆62Updated 10 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆152Updated last year
- ☆80Updated 10 months ago
- LLM verified with Monte Carlo Tree Search☆284Updated 10 months ago
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- ☆87Updated 2 years ago
- Commit0: Library Generation from Scratch☆177Updated 8 months ago
- ☆41Updated last year
- Can Language Models Solve Olympiad Programming?☆123Updated last year
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆82Updated last year
- ☆123Updated 11 months ago
- Multimodal computer agent data collection program☆161Updated 2 months ago
- ☆144Updated last year
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆90Updated 2 years ago
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆49Updated 2 years ago
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆101Updated 2 years ago
- ☆216Updated 2 years ago
- Track the progress of LLM context utilisation☆55Updated 9 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Updated 9 months ago
- A codebase for "Language Models can Solve Computer Tasks"☆239Updated last year
- ☆119Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆69Updated last year