SakanaAI / self-adaptive-llms
A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!
β977Updated last month
Alternatives and similar repositories for self-adaptive-llms:
Users that are interested in self-adaptive-llms are comparing it to the libraries listed below
- Training Large Language Model to Reason in a Continuous Latent Spaceβ926Updated last month
- [ICLR 2025] Automated Design of Agentic Systemsβ1,200Updated last month
- β1,006Updated 2 months ago
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"β849Updated last week
- An Open Large Reasoning Model for Real-World Solutionsβ1,465Updated 3 months ago
- Large Concept Models: Language modeling in a sentence representation spaceβ1,949Updated last month
- Recipes to scale inference-time compute of open modelsβ1,019Updated last week
- System 2 Reasoning Link Collectionβ806Updated 3 weeks ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.β294Updated 4 months ago
- Code for BLT research paperβ1,417Updated this week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineeringβ626Updated last month
- Optimizing inference proxy for LLMsβ2,070Updated this week
- VPTQ, A Flexible and Extreme low-bit quantization algorithmβ593Updated this week
- prime is a framework for efficient, globally distributed training of AI models over the internet.β663Updated this week
- β1,338Updated 3 months ago
- OLMoE: Open Mixture-of-Experts Language Modelsβ634Updated 2 months ago
- Sky-T1: Train your own O1 preview model within $450β3,037Updated this week
- Synthetic data curation for post-training and structured data extractionβ901Updated this week
- Code for Quiet-STaRβ716Updated 6 months ago
- AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.β762Updated this week
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorchβ1,151Updated this week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.β1,255Updated 2 weeks ago