SakanaAI / self-adaptive-llms
A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!
β1,058Updated 3 months ago
Alternatives and similar repositories for self-adaptive-llms
Users that are interested in self-adaptive-llms are comparing it to the libraries listed below
Sorting:
- Code for BLT research paperβ1,587Updated this week
- Dream 7B, a large diffusion language modelβ630Updated 2 weeks ago
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorchβ1,323Updated last month
- Official PyTorch implementation for "Large Language Diffusion Models"β1,592Updated last week
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modelingβ871Updated 2 weeks ago
- Verifiers for LLM Reinforcement Learningβ953Updated this week
- Pretraining code for a large-scale depth-recurrent language modelβ760Updated last month
- Recipes to scale inference-time compute of open modelsβ1,071Updated last week
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,109Updated 3 months ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.β749Updated last month
- Continuous Thought Machines, because thought takes time and reasoning is a process.β492Updated this week
- Large Concept Models: Language modeling in a sentence representation spaceβ2,153Updated 3 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.β307Updated 6 months ago
- An Open Large Reasoning Model for Real-World Solutionsβ1,488Updated 2 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineeringβ703Updated last week
- [ICLR 2025] Automated Design of Agentic Systemsβ1,286Updated 3 months ago
- Releases from OpenAI Preparednessβ736Updated this week
- Muon is Scalable for LLM Trainingβ1,044Updated last month
- prime is a framework for efficient, globally distributed training of AI models over the internet.β743Updated last week
- System 2 Reasoning Link Collectionβ833Updated 2 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β327Updated 5 months ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"β517Updated 2 months ago
- Build your own visual reasoning modelβ362Updated this week
- An Open Source Toolkit For LLM Distillationβ596Updated 2 weeks ago
- LIMO: Less is More for Reasoningβ940Updated last month
- nanoGPT style version of Llama 3.1β1,367Updated 9 months ago
- Official Repository of Absolute Zero Reasonerβ829Updated this week
- Atom of Thoughts for Markov LLM Test-Time Scalingβ563Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ1,516Updated last week
- A reading list on LLM based Synthetic Data Generation π₯β1,265Updated 2 months ago