SakanaAI / continuous-thought-machinesLinks
Continuous Thought Machines, because thought takes time and reasoning is a process.
β1,578Updated last month
Alternatives and similar repositories for continuous-thought-machines
Users that are interested in continuous-thought-machines are comparing it to the libraries listed below
Sorting:
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorchβ1,566Updated 2 weeks ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,174Updated 10 months ago
- Pretraining and inference code for a large-scale depth-recurrent language modelβ852Updated last month
- Code for BLT research paperβ2,013Updated last month
- H-Net: Hierarchical Network with Dynamic Chunkingβ793Updated 3 weeks ago
- Self-Adapting Language Modelsβ1,593Updated 4 months ago
- Dream 7B, a large diffusion language modelβ1,099Updated 3 weeks ago
- β569Updated 6 months ago
- Darwin GΓΆdel Machine: Open-Ended Evolution of Self-Improving Agentsβ1,760Updated 3 months ago
- Muon is an optimizer for hidden layers in neural networksβ2,075Updated 2 weeks ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolutionβ705Updated last week
- β712Updated last week
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learningβ563Updated last month
- AlphaGo Moment for Model Architecture Discovery.β1,122Updated last week
- Official Repository of Absolute Zero Reasonerβ1,769Updated 3 months ago
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,382Updated 4 months ago
- dLLM: Simple Diffusion Language Modelingβ1,261Updated this week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the inputβ927Updated 6 months ago
- β2,477Updated last month
- A minimal implementation of DeepMind's Genie world modelβ1,052Updated 2 weeks ago
- β5,964Updated last week
- Official PyTorch implementation for "Large Language Diffusion Models"β3,365Updated last month
- Automating the Search for Artificial Life with Foundation Models!β445Updated last month
- Large Concept Models: Language modeling in a sentence representation spaceβ2,309Updated 10 months ago
- Spiking Brain-inspired Large Models, integrating hybrid efficient attention, MoE modules and spike encoding into its architectureβ1,212Updated last week
- PyTorch code and models for VJEPA2 self-supervised learning from video.β2,529Updated 3 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.β330Updated last year
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)β523Updated 2 months ago
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's Aβ¦β947Updated 6 months ago
- A Reproduction of GDM's Nested Learning Paperβ407Updated last week