A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,189Jan 30, 2025Updated last year
Alternatives and similar repositories for self-adaptive-llms
Users that are interested in self-adaptive-llms are comparing it to the libraries listed below
Sorting:
- Code for BLT research paper☆2,029Nov 3, 2025Updated 4 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,522Aug 12, 2025Updated 6 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆350Oct 22, 2024Updated last year
- Pretraining and inference code for a large-scale depth-recurrent language model☆864Dec 29, 2025Updated 2 months ago
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬☆12,216Dec 19, 2025Updated 2 months ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,403Nov 29, 2024Updated last year
- Automating the Search for Artificial Life with Foundation Models!☆451Oct 23, 2025Updated 4 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,338Jan 29, 2025Updated last year
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,788Dec 29, 2025Updated 2 months ago
- Tools for merging pretrained large language models.☆6,826Updated this week
- Minimal reproduction of DeepSeek R1-Zero☆12,853Feb 27, 2026Updated last week
- Minimalistic large language model 3D-parallelism training☆2,579Feb 19, 2026Updated 2 weeks ago
- Sky-T1: Train your own O1 preview model within $450☆3,369Jul 12, 2025Updated 7 months ago
- ☆231Feb 24, 2025Updated last year
- Fully open reproduction of DeepSeek-R1☆25,910Nov 24, 2025Updated 3 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆372Dec 12, 2024Updated last year
- s1: Simple test-time scaling