SakanaAI / natural_nichesLinks
The code repository of the paper: Competition and Attraction Improve Model Fusion
☆168Updated 4 months ago
Alternatives and similar repositories for natural_niches
Users that are interested in natural_niches are comparing it to the libraries listed below
Sorting:
- ☆105Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆246Updated this week
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 8 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆353Updated 6 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆451Updated 4 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆85Updated 9 months ago
- Code for ExploreTom☆89Updated 6 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆114Updated last week
- 🧬 The Huxley-Gödel Machine☆314Updated last month
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- ☆159Updated 8 months ago
- ☆36Updated 4 months ago
- ☆79Updated 2 months ago
- ☆93Updated last month
- RLP: Reinforcement as a Pretraining Objective☆218Updated 2 months ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆46Updated 5 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆290Updated 2 months ago
- ☆145Updated last week
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆229Updated this week
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆244Updated last month
- Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"☆327Updated last month
- The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆304Updated this week
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆301Updated last week
- Code for Bolmo: Byteifying the Next Generation of Language Models☆109Updated this week
- Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, and full end-to-end reference exampl…☆246Updated this week
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆40Updated 2 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆37Updated last month
- Simple & Scalable Pretraining for Neural Architecture Research☆305Updated 3 weeks ago