SakanaAI / continuous-thought-machinesLinks
Continuous Thought Machines, because thought takes time and reasoning is a process.
☆1,396Updated last month
Alternatives and similar repositories for continuous-thought-machines
Users that are interested in continuous-thought-machines are comparing it to the libraries listed below
Sorting:
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,166Updated 9 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆847Updated last month
- Code for BLT research paper☆2,008Updated 2 weeks ago
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch☆1,508Updated last month
- Self-Adapting Language Models☆1,502Updated 3 months ago
- Dream 7B, a large diffusion language model☆1,081Updated last month
- Training Large Language Model to Reason in a Continuous Latent Space☆1,339Updated 3 months ago
- AlphaGo Moment for Model Architecture Discovery.☆1,114Updated 3 months ago
- H-Net: Hierarchical Network with Dynamic Chunking☆778Updated last month
- Muon is an optimizer for hidden layers in neural networks☆2,002Updated 4 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,263Updated last week
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆2,438Updated 2 months ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,222Updated last week
- ☆525Updated 5 months ago
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆553Updated last week
- Official Repository of Absolute Zero Reasoner☆1,747Updated 2 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆913Updated 5 months ago
- dLLM: Simple Diffusion Language Modeling☆950Updated this week
- Automating the Search for Artificial Life with Foundation Models!☆442Updated 3 weeks ago
- Muon is Scalable for LLM Training☆1,359Updated 3 months ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆927Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,670Updated 7 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆644Updated last week
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆940Updated 5 months ago
- [ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters☆578Updated 9 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆559Updated last month
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆508Updated last month
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,741Updated 3 months ago
- ☆2,432Updated 2 weeks ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆564Updated last year