Arvid-pku / Godel_Agent
GΓΆdel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
β70Updated last week
Alternatives and similar repositories for Godel_Agent:
Users that are interested in Godel_Agent are comparing it to the libraries listed below
- Repository for the paper Stream of Search: Learning to Search in Languageβ125Updated 5 months ago
- π Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Papβ¦β136Updated last month
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasonersβ68Updated 3 weeks ago
- β148Updated 2 weeks ago
- β87Updated last week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gymβ251Updated 2 weeks ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.β156Updated 3 months ago
- β98Updated last week
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.β158Updated 2 weeks ago
- β42Updated 11 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)β50Updated 5 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β98Updated 4 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"β47Updated last month
- Evaluating LLMs with CommonGen-Liteβ88Updated 10 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'β177Updated last month
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examplesβ58Updated 2 weeks ago
- AWM: Agent Workflow Memoryβ233Updated 2 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"β79Updated 3 months ago
- β69Updated 2 weeks ago
- β120Updated 7 months ago
- Benchmarking LLMs with Challenging Tasks from Real Usersβ210Updated 2 months ago
- β56Updated last week
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agentsβ114Updated 7 months ago
- β38Updated 6 months ago
- A simple unified framework for evaluating LLMsβ172Updated this week
- This is the official repository for Inheritune.β109Updated 3 months ago
- UGround: Universal GUI Visual Grounding for GUI Agentsβ147Updated this week
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"β50Updated 3 months ago
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planningβ203Updated 2 weeks ago