keskival / recursive-self-improvement-suiteLinks
A suite of open-ended, non-imitative tasks involving generalizable skills for large language model chatbots and agents to enable bootstrapped recursive self-improvement and an unambiguous AGI.
☆37Updated 5 months ago
Alternatives and similar repositories for recursive-self-improvement-suite
Users that are interested in recursive-self-improvement-suite are comparing it to the libraries listed below
Sorting:
- Automated Capability Discovery via Foundation Model Self-Exploration☆57Updated 5 months ago
- Fun project to run your own LLM chat bot using llama.cpp☆11Updated 2 years ago
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆44Updated last year
- Evaluation of neuro-symbolic engines☆38Updated 11 months ago
- LMQL implementation of tree of thoughts☆34Updated last year
- ☆51Updated 3 weeks ago
- ☆83Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆124Updated last year
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 4 months ago
- ☆45Updated 9 months ago
- accompanying material for sleep-time compute paper☆97Updated 2 months ago
- ☆24Updated 2 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆78Updated 3 months ago
- ☆84Updated last year
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆74Updated 7 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆163Updated 4 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆71Updated 7 months ago
- The AgentForge project focuses on building general tooling to construct multicapability AI systems by composing skills and models togethe…☆17Updated last year
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆24Updated 3 months ago
- Based on the tree of thoughts paper☆48Updated last year
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆66Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆58Updated 7 months ago
- ⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆48Updated 2 weeks ago
- A benchmark that challenges language models to code solutions for scientific problems☆127Updated last week
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆17Updated last year
- ☆40Updated 11 months ago
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆42Updated 2 weeks ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 9 months ago
- ☆94Updated last month
- A repository re-creating the PromptBreeder Evolutionary Algorithm from the DeepMind Paper in Python using LMQL as the backend.☆27Updated last year