SakanaAI / continuous-thought-machinesLinks
Continuous Thought Machines, because thought takes time and reasoning is a process.
β1,687Updated last week
Alternatives and similar repositories for continuous-thought-machines
Users that are interested in continuous-thought-machines are comparing it to the libraries listed below
Sorting:
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,179Updated 11 months ago
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorchβ1,829Updated 2 weeks ago
- Pretraining and inference code for a large-scale depth-recurrent language modelβ857Updated last week
- Code for BLT research paperβ2,018Updated 2 months ago
- AlphaGo Moment for Model Architecture Discovery.β1,127Updated last month
- β596Updated 7 months ago
- Muon is an optimizer for hidden layers in neural networksβ2,154Updated last month
- H-Net: Hierarchical Network with Dynamic Chunkingβ798Updated last month
- Darwin GΓΆdel Machine: Open-Ended Evolution of Self-Improving Agentsβ1,782Updated 4 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolutionβ761Updated last week
- Implementing DeepSeek R1's GRPO algorithm from scratchβ1,724Updated 8 months ago
- Self-Adapting Language Modelsβ1,629Updated 5 months ago
- PyTorch code and models for VJEPA2 self-supervised learning from video.β2,618Updated 4 months ago
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,430Updated 4 months ago
- Dream 7B, a large diffusion language modelβ1,134Updated last month
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learningβ569Updated last month
- A minimal implementation of DeepMind's Genie world modelβ1,081Updated last month
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewardsβ1,296Updated 3 weeks ago
- A Reproduction of GDM's Nested Learning Paperβ524Updated last month
- Official Repository of Absolute Zero Reasonerβ1,782Updated 4 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the inputβ933Updated 6 months ago
- Automating the Search for Artificial Life with Foundation Models!β448Updated 2 months ago
- Large Concept Models: Language modeling in a sentence representation spaceβ2,317Updated 11 months ago
- dLLM: Simple Diffusion Language Modelingβ1,541Updated this week
- NanoGPT (124M) in 3 minutesβ4,085Updated this week
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.β345Updated last year
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse β¦β780Updated this week
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scalingβ627Updated last month
- Official PyTorch implementation for "Large Language Diffusion Models"β3,459Updated last month
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's Aβ¦β958Updated 7 months ago