SakanaAI / natural_nichesLinks
The code repository of the paper: Competition and Attraction Improve Model Fusion
☆161Updated 2 months ago
Alternatives and similar repositories for natural_niches
Users that are interested in natural_niches are comparing it to the libraries listed below
Sorting:
- ☆103Updated 3 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆446Updated 2 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 6 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆346Updated 4 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆97Updated this week
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆83Updated 7 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆204Updated last week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆273Updated 3 months ago
- ☆158Updated 6 months ago
- Code for ExploreTom☆86Updated 4 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆225Updated last week
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 10 months ago
- ☆79Updated 3 weeks ago
- The State Of The Art, intelligence☆154Updated 2 months ago
- An Automatic Prompt Optimization Framework for Large Language Models☆130Updated 2 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆27Updated last week
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated last month
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆190Updated 2 months ago
- Train your own SOTA deductive reasoning model☆108Updated 7 months ago
- ☆83Updated 2 months ago
- ☆110Updated last month
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆463Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆77Updated last month
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆53Updated 2 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆297Updated 2 months ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆682Updated last month
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆37Updated 2 weeks ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆479Updated last week
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆45Updated 3 months ago
- ☆300Updated 2 months ago