SakanaAI / self-adaptive-llmsLinks
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,187Updated last year
Alternatives and similar repositories for self-adaptive-llms
Users that are interested in self-adaptive-llms are comparing it to the libraries listed below
Sorting:
- Pretraining and inference code for a large-scale depth-recurrent language model☆863Updated last month
- Code for BLT research paper☆2,028Updated 3 months ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,750Updated last month
- Training Large Language Model to Reason in a Continuous Latent Space☆1,496Updated 6 months ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆944Updated 2 months ago
- Recipes to scale inference-time compute of open models☆1,124Updated 8 months ago
- dLLM: Simple Diffusion Language Modeling☆1,716Updated last week
- Dream 7B, a large diffusion language model☆1,164Updated 2 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,333Updated last year
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆347Updated last year
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,819Updated 6 months ago
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch☆1,924Updated 2 weeks ago
- System 2 Reasoning Link Collection☆870Updated 10 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆940Updated 8 months ago
- OpenAI Frontier Evals☆994Updated 2 months ago
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆641Updated this week
- ☆1,033Updated last year
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆1,301Updated 3 weeks ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,554Updated 3 months ago
- ☆1,388Updated 5 months ago
- OLMoE: Open Mixture-of-Experts Language Models☆967Updated 4 months ago
- AlphaGo Moment for Model Architecture Discovery.☆1,133Updated 2 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,533Updated last week
- [ICLR 2025] Automated Design of Agentic Systems☆1,513Updated last year
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,332Updated 3 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆371Updated last year
- [COLM 2025] LIMO: Less is More for Reasoning☆1,062Updated 6 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆835Updated 2 weeks ago
- Self-Adapting Language Models☆1,696Updated 6 months ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆863Updated 4 months ago