keskival / recursive-self-improvement-suiteLinks

A suite of open-ended, non-imitative tasks involving generalizable skills for large language model chatbots and agents to enable bootstrapped recursive self-improvement and an unambiguous AGI.

☆37

Alternatives and similar repositories for recursive-self-improvement-suite

Users that are interested in recursive-self-improvement-suite are comparing it to the libraries listed below

Sorting:

conglu1997 / ACD
Automated Capability Discovery via Foundation Model Self-Exploration
☆57Updated 5 months ago
robjsliwa / llama-agent
Fun project to run your own LLM chat bot using llama.cpp
☆11Updated 2 years ago
microsoft / stop
Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation
☆44Updated last year
ExtensityAI / benchmark
Evaluation of neuro-symbolic engines
☆38Updated 11 months ago
LachlanGray / lmql-tree-of-thoughts
LMQL implementation of tree of thoughts
☆34Updated last year
yueqis / API-Based-Agent
☆51Updated 3 weeks ago
allenai / clin
☆83Updated last year
agiresearch / Formal-LLM
Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents
☆124Updated last year
ScalingIntelligence / Archon
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆173Updated 4 months ago
collinskatie / checkmate
☆45Updated 9 months ago
letta-ai / sleep-time-compute
accompanying material for sleep-time compute paper
☆97Updated 2 months ago
google-deepmind / questbench
☆24Updated 2 months ago
METR / eval-analysis-public
Public repository containing METR's DVC pipeline for eval data analysis
☆78Updated 3 months ago
togethercomputer / Llama-2-7B-32K-Instruct
☆84Updated last year
GoodAI / goodai-ltm-benchmark
A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…
☆74Updated 7 months ago
allenai / discoveryworld
A virtual environment for developing and evaluating automated scientific discovery agents.
☆163Updated 4 months ago
HishamAlyahya / semantic_backprop
Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖
☆71Updated 7 months ago
ManifoldRG / AgentForge
The AgentForge project focuses on building general tooling to construct multicapability AI systems by composing skills and models togethe…
☆17Updated last year
SalesforceAIResearch / CodeTree
Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models
☆24Updated 3 months ago
qrdlgit / graph-of-thoughts
Based on the tree of thoughts paper
☆48Updated last year
yecchen / MIRAI
Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"
☆66Updated last year
oriyor / assistantbench
Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"
☆58Updated 7 months ago
interp-reasoning / thought-anchors
⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.
☆48Updated 2 weeks ago
scicode-bench / SciCode
A benchmark that challenges language models to code solutions for scientific problems
☆127Updated last week
plastic-labs / dspy-opentom
Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset
☆17Updated last year
asappresearch / webagents-step
☆40Updated 11 months ago
yale-nlp / SciArena
Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"
☆42Updated 2 weeks ago
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆88Updated 9 months ago
aorwall / moatless-tree-search
☆94Updated last month
ambroser53 / Prompt-Day-Care
A repository re-creating the PromptBreeder Evolutionary Algorithm from the DeepMind Paper in Python using LMQL as the backend.
☆27Updated last year