conglu1997 / ACD
Automated Capability Discovery via Foundation Model Self-Exploration
☆45Updated 2 months ago
Alternatives and similar repositories for ACD:
Users that are interested in ACD are comparing it to the libraries listed below
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆55Updated 2 months ago
- ☆38Updated 9 months ago
- Collection of LLM completions for reasoning-gym task datasets☆19Updated this week
- Skill Design From AI Feedback☆28Updated 2 months ago
- look how they massacred my boy☆63Updated 6 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 11 months ago
- ☆20Updated 6 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆145Updated 3 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 2 months ago
- ☆11Updated 9 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆51Updated last month
- accompanying material for sleep-time compute paper☆73Updated this week
- EvaByte: Efficient Byte-level Language Models at Scale☆91Updated 2 weeks ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆68Updated 2 months ago
- A repository for training nanogpt-based Chess playing language models.☆24Updated last year
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆43Updated 8 months ago
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆41Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 weeks ago
- ☆82Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 3 months ago
- Memoria is a human-inspired memory architecture for neural networks.☆70Updated 6 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 6 months ago
- ☆27Updated 8 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆45Updated 3 months ago
- Lego for GRPO☆27Updated last month
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆65Updated last month
- LLM reads a paper and produce a working prototype☆52Updated 3 weeks ago
- ☆130Updated last month
- ☆80Updated 3 months ago