SakanaAI / continuous-thought-machinesLinks
Continuous Thought Machines, because thought takes time and reasoning is a process.
β1,026Updated 3 weeks ago
Alternatives and similar repositories for continuous-thought-machines
Users that are interested in continuous-thought-machines are comparing it to the libraries listed below
Sorting:
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,106Updated 4 months ago
- Pretraining code for a large-scale depth-recurrent language modelβ783Updated 2 weeks ago
- Official PyTorch implementation for "Large Language Diffusion Models"β2,378Updated last week
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorchβ1,384Updated 3 weeks ago
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,162Updated 5 months ago
- Dream 7B, a large diffusion language modelβ774Updated last week
- Code for BLT research paperβ1,686Updated last month
- Open-source implementation of AlphaEvolveβ2,676Updated this week
- Build your own visual reasoning modelβ385Updated last week
- Muon: An optimizer for hidden layers in neural networksβ897Updated 2 weeks ago
- Automating the Search for Artificial Life with Foundation Models!β420Updated 5 months ago
- Official repository for our work on micro-budget training of large-scale diffusion models.β1,481Updated 5 months ago
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Modelsβ1,136Updated last week
- Large Concept Models: Language modeling in a sentence representation spaceβ2,233Updated 4 months ago
- Muon is Scalable for LLM Trainingβ1,081Updated 2 months ago
- procedural reasoning datasetsβ872Updated this week
- PyTorch code and models for VJEPA2 self-supervised learning from video.β1,522Updated this week
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!β1,294Updated 3 weeks ago
- Minimalistic 4D-parallelism distributed training framework for education purposeβ1,554Updated 3 weeks ago
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Modelsβ703Updated 2 months ago
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"β164Updated 4 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"β476Updated last month
- β1,153Updated last month
- β313Updated last month
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.β311Updated 8 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the inputβ737Updated 2 weeks ago
- Official Repository of Absolute Zero Reasonerβ1,542Updated 3 weeks ago
- Recipes to scale inference-time compute of open modelsβ1,097Updated last month
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β339Updated 6 months ago
- Verifiers for LLM Reinforcement Learningβ1,328Updated this week