SakanaAI / drqLinks
Digital Red Queen: Adversarial Program Evolution in Core War with LLMs
☆173Updated 2 weeks ago
Alternatives and similar repositories for drq
Users that are interested in drq are comparing it to the libraries listed below
Sorting:
- look how they massacred my boy☆63Updated last year
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆143Updated this week
- ☆159Updated last month
- Automated Capability Discovery via Foundation Model Self-Exploration☆66Updated 11 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 8 months ago
- lossily compress representation vectors using product quantization☆59Updated 3 months ago
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆59Updated last year
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆115Updated last month
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- ☆40Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆186Updated last week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆260Updated last week
- explore token trajectory trees on instruct and base models☆150Updated 8 months ago
- Approximating the joint distribution of language models via MCTS☆22Updated last year
- ☆38Updated 5 months ago
- ☆82Updated 4 months ago
- ☆64Updated 10 months ago
- Ludic – an LLM-RL library for the era of experience☆55Updated 3 weeks ago
- Lego for GRPO☆30Updated 8 months ago
- 🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectu…☆27Updated 6 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆101Updated 6 months ago
- Scaling Coding-Agent RL to 32x H100s. **Achieving 160% improvement** on Stanford's TerminalBench☆91Updated 2 months ago
- Pivotal Token Search☆144Updated last month
- 🧬 The Huxley-Gödel Machine☆319Updated 2 months ago
- Codebase from our first release.☆41Updated 3 weeks ago
- ☆62Updated 6 months ago
- Repository to create traveling waves integrate special information through time☆56Updated 5 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year