keskival / recursive-self-improvement-suiteLinks
A suite of open-ended, non-imitative tasks involving generalizable skills for large language model chatbots and agents to enable bootstrapped recursive self-improvement and an unambiguous AGI.
☆39Updated 7 months ago
Alternatives and similar repositories for recursive-self-improvement-suite
Users that are interested in recursive-self-improvement-suite are comparing it to the libraries listed below
Sorting:
- Automated Capability Discovery via Foundation Model Self-Exploration☆63Updated 6 months ago
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆44Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆127Updated last year
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆76Updated 8 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆180Updated 5 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆179Updated 5 months ago
- accompanying material for sleep-time compute paper☆107Updated 4 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆150Updated 6 months ago
- ☆98Updated 4 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆98Updated this week
- ☆56Updated 2 months ago
- LMQL implementation of tree of thoughts☆34Updated last year
- ☆41Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆18Updated last year
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆229Updated last month
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆57Updated 2 months ago
- Evaluation of neuro-symbolic engines☆39Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 8 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆61Updated 4 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated this week
- A benchmark that challenges language models to code solutions for scientific problems☆132Updated last week
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆88Updated 11 months ago
- ☆84Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆104Updated last month
- 🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"☆95Updated 4 months ago
- ☆98Updated 11 months ago
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆47Updated 3 weeks ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆185Updated this week
- ⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆70Updated last week