SakanaAI / continuous-thought-machinesLinks

Continuous Thought Machines, because thought takes time and reasoning is a process.

☆1,238

Alternatives and similar repositories for continuous-thought-machines

Users that are interested in continuous-thought-machines are comparing it to the libraries listed below

Sorting:

SakanaAI / self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,132Updated 6 months ago
seal-rg / recurrent-pretraining
Pretraining and inference code for a large-scale depth-recurrent language model
☆808Updated 3 weeks ago
facebookresearch / blt
Code for BLT research paper
☆1,765Updated 2 months ago
lucidrains / titans-pytorch
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch
☆1,425Updated 2 months ago
Continual-Intelligence / SEAL
Self-Adapting Language Models
☆743Updated this week
SakanaAI / text-to-lora
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
☆836Updated last month
SakanaAI / asal
Automating the Search for Artificial Life with Foundation Models!
☆427Updated 6 months ago
DreamLM / Dream
Dream 7B, a large diffusion language model
☆873Updated last month
jennyzzt / dgm
Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents
☆1,550Updated last month
LeapLabTHU / Absolute-Zero-Reasoner
Official Repository of Absolute Zero Reasoner
☆1,635Updated last week
facebookresearch / vjepa2
PyTorch code and models for VJEPA2 self-supervised learning from video.
☆1,972Updated last month
GAIR-NLP / ASI-Arch
AlphaGo Moment for Model Architecture Discovery.
☆960Updated this week
arcprize / ARC-AGI-2
☆402Updated 2 months ago
facebookresearch / coconut
Training Large Language Model to Reason in a Continuous Latent Space
☆1,224Updated 6 months ago
KellerJordan / Muon
Muon is an optimizer for hidden layers in neural networks
☆1,454Updated 3 weeks ago
facebookresearch / large_concept_model
Large Concept Models: Language modeling in a sentence representation space
☆2,257Updated 6 months ago
shyamsaktawat / OpenAlpha_Evolve
OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…
☆866Updated 2 months ago
SakanaAI / AI-Scientist-v2
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
☆1,479Updated 3 months ago
open-thought / reasoning-gym
procedural reasoning datasets
☆1,012Updated this week
alexiglad / EBT
PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning
☆400Updated 2 weeks ago
ML-GSAI / LLaDA
Official PyTorch implementation for "Large Language Diffusion Models"
☆2,658Updated last week
policy-gradient / GRPO-Zero
Implementing DeepSeek R1's GRPO algorithm from scratch
☆1,508Updated 3 months ago
safety-research / circuit-tracer
☆2,212Updated last week
groundlight / r1_vlm
Build your own visual reasoning model
☆401Updated this week
goombalab / hnet
H-Net: Hierarchical Network with Dynamic Chunking
☆632Updated last week
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆318Updated 9 months ago
raymin0223 / mixture_of_recursions
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation
☆367Updated this week
facebookresearch / MLGym
MLGym A New Framework and Benchmark for Advancing AI Research Agents
☆538Updated 2 weeks ago
qixucen / atom
Atom of Thoughts for Markov LLM Test-Time Scaling
☆580Updated last month
lucidrains / native-sparse-attention-pytorch
Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
☆700Updated last month