SakanaAI / natural_nichesLinks
The code repository of the paper: Competition and Attraction Improve Model Fusion
☆70Updated this week
Alternatives and similar repositories for natural_niches
Users that are interested in natural_niches are comparing it to the libraries listed below
Sorting:
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆21Updated last week
- ☆94Updated last month
- Repository to create traveling waves integrate special information through time☆54Updated 2 weeks ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆88Updated this week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 6 months ago
- Lego for GRPO☆28Updated 3 months ago
- ☆17Updated 6 months ago
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆47Updated 3 weeks ago
- LLM reads a paper and produce a working prototype☆57Updated 4 months ago
- Simple GRPO scripts and configurations.☆59Updated 6 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆73Updated 8 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆97Updated 3 weeks ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆51Updated 10 months ago
- Source code and utilities for the Genesys distributed language model architecture discovery system.☆47Updated last month
- ☆34Updated 3 weeks ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆31Updated last week
- ☆19Updated 5 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆34Updated 3 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆73Updated 5 months ago
- ☆55Updated 2 months ago
- The Library for LLM-based multi-agent applications☆92Updated last month
- ☆21Updated 9 months ago
- ☆155Updated 4 months ago
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated last week
- ☆65Updated 2 weeks ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 10 months ago
- ☆15Updated last month
- An AI character interaction system with emotional modeling and advanced memory management☆16Updated 10 months ago
- ☆50Updated last week
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 3 months ago