Arvid-pku / Godel_Agent
Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
☆44Updated last week
Related projects ⓘ
Alternatives and complementary repositories for Godel_Agent
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆38Updated 3 weeks ago
- Evaluating LLMs with CommonGen-Lite☆84Updated 7 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆166Updated this week
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆73Updated 2 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆62Updated last month
- Official homepage for "Self-Harmonized Chain of Thought"☆83Updated last month
- ☆38Updated 8 months ago
- ☆38Updated 3 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆62Updated this week
- ☆102Updated 2 months ago
- ☆116Updated 5 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆109Updated 4 months ago
- entropix style sampling + GUI☆25Updated last week
- ☆74Updated 2 weeks ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆119Updated 3 weeks ago
- ☆103Updated 2 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆64Updated 4 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆105Updated 2 weeks ago
- Repository for the paper Stream of Search: Learning to Search in Language☆84Updated 3 months ago
- ☆72Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- ☆101Updated last month
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆160Updated last month
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆106Updated 2 weeks ago
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆71Updated 10 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- LLM reads a paper and produce a working prototype☆33Updated this week
- The first dense retrieval model that can be prompted like an LM☆62Updated last month
- ☆38Updated this week