conglu1997 / ACDLinks

Automated Capability Discovery via Foundation Model Self-Exploration

☆65

Alternatives and similar repositories for ACD

Users that are interested in ACD are comparing it to the libraries listed below

Sorting:

conglu1997 / intelligent-go-explore
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
☆65Updated 9 months ago
jennyzzt / omni
OMNI: Open-endedness via Models of human Notions of Interestingness
☆57Updated 10 months ago
maxencefaldor / omni-epic
OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).
☆71Updated 11 months ago
xjdr-alt / muzero_sketch
☆40Updated last year
mklissa / maestromotif
Skill Design From AI Feedback
☆32Updated 9 months ago
microsoft / stop
Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation
☆49Updated last year
open-thought / reasoning-gym-eval
Collection of LLM completions for reasoning-gym task datasets
☆30Updated 5 months ago
google-deepmind / questbench
☆34Updated 6 months ago
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆151Updated 10 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆107Updated 9 months ago
adamkarvonen / chess_gpt_eval
A repo to evaluate various LLM's chess playing abilities.
☆85Updated last year
ScalingIntelligence / Archon
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆189Updated 9 months ago
google-deepmind / mishax
☆144Updated 3 months ago
METR / eval-analysis-public
Public repository containing METR's DVC pipeline for eval data analysis
☆140Updated 8 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
goncalorafaria / qalign
QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.
☆25Updated 3 weeks ago
facebookresearch / motif
Intrinsic Motivation from Artificial Intelligence Feedback
☆133Updated 2 years ago
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆99Updated 4 months ago
Danau5tin / calculator_agent_rl
Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.
☆60Updated 7 months ago
arcprize / ARC-AGI-3-Agents
☆99Updated 2 months ago
keskival / recursive-self-improvement-suite
A suite of open-ended, non-imitative tasks involving generalizable skills for large language model chatbots and agents to enable bootstra…
☆41Updated 10 months ago
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆164Updated 7 months ago
gkamradt / SnakeBench
☆97Updated this week
allenai / discoveryworld
A virtual environment for developing and evaluating automated scientific discovery agents.
☆192Updated 8 months ago
doomslide / autoloom
Approximating the joint distribution of language models via MCTS
☆22Updated last year
huggingface / jat
General multi-task deep RL Agent
☆185Updated last year
VsonicV / es-fine-tuning-paper
This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"
☆266Updated 2 weeks ago
ZeroSumEval / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆34Updated 7 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆72Updated 7 months ago
PrimeIntellect-ai / prime-environments
Training-Ready RL Environments + Evals
☆185Updated this week