SakanaAI / drqLinks
Digital Red Queen: Adversarial Program Evolution in Core War with LLMs
☆176Updated 3 weeks ago
Alternatives and similar repositories for drq
Users that are interested in drq are comparing it to the libraries listed below
Sorting:
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- ☆160Updated last month
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆151Updated this week
- ☆141Updated 4 months ago
- Codebase from our first release.☆43Updated last month
- Scaling Coding-Agent RL to 32x H100s. **Achieving 160% improvement** on Stanford's TerminalBench☆92Updated 3 months ago
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆126Updated 3 weeks ago
- look how they massacred my boy☆63Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- Automated Capability Discovery via Foundation Model Self-Exploration☆66Updated 11 months ago
- ☆39Updated 6 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 9 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Code for Bolmo: Byteifying the Next Generation of Language Models☆117Updated last month
- Training Proactive and Personalized LLM Agents☆98Updated 3 weeks ago
- Marketplace ML experiment - training without backprop☆27Updated 5 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- SoTA Approach for ARC-AGI 2☆159Updated 4 months ago
- lossily compress representation vectors using product quantization☆59Updated 3 months ago
- ☆66Updated 11 months ago
- ☆40Updated last year
- explore token trajectory trees on instruct and base models☆150Updated 8 months ago
- ☆62Updated 7 months ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 7 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆199Updated last week
- 🧬 The Huxley-Gödel Machine☆324Updated this week
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆59Updated last year
- ☆67Updated 8 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆128Updated 4 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆56Updated 6 months ago