SakanaAI / natural_nichesLinks
The code repository of the paper: Competition and Attraction Improve Model Fusion
☆167Updated 3 months ago
Alternatives and similar repositories for natural_niches
Users that are interested in natural_niches are comparing it to the libraries listed below
Sorting:
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆225Updated this week
- ☆105Updated 5 months ago
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆85Updated 8 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆110Updated 7 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆274Updated 4 months ago
- ☆158Updated 7 months ago
- ☆79Updated 2 months ago
- Code for ExploreTom☆88Updated 5 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆352Updated 5 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆450Updated 3 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆100Updated 3 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆278Updated last month
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆132Updated 2 months ago
- RLP: Reinforcement as a Pretraining Objective☆205Updated 2 months ago
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆206Updated 3 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆36Updated 2 weeks ago
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆289Updated this week
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆40Updated last month
- Simple & Scalable Pretraining for Neural Architecture Research☆304Updated last month
- 🧬 The Huxley-Gödel Machine☆305Updated last week
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆45Updated 4 months ago
- A Reproduction of GDM's Nested Learning Paper☆298Updated this week
- accompanying material for sleep-time compute paper☆118Updated 7 months ago
- open source alpha evolve☆67Updated 6 months ago
- ☆88Updated last month
- Latent Collaboration in Multi-Agent Systems (LatentMAS)☆491Updated this week
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆237Updated 3 weeks ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆479Updated 3 months ago