☆6,374Dec 2, 2025Updated 3 months ago
Alternatives and similar repositories for TinyRecursiveModels
Users that are interested in TinyRecursiveModels are comparing it to the libraries listed below
Sorting:
- Hierarchical Reasoning Model Official Release☆12,339Sep 9, 2025Updated 6 months ago
- Minimal reproduction of DeepSeek R1-Zero☆12,896Feb 27, 2026Updated last week
- My submission to the ARC-AGI-3 Developer Preview Agent Compitition.☆42Jan 27, 2026Updated last month
- Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama s…☆70Dec 22, 2025Updated 2 months ago
- Official inference framework for 1-bit LLMs☆28,697Feb 3, 2026Updated last month
- ☆172Aug 15, 2025Updated 6 months ago
- NanoGPT (124M) in 2 minutes☆4,734Feb 27, 2026Updated last week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,529Aug 12, 2025Updated 6 months ago
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- Open-source implementation of AlphaEvolve☆5,525Feb 4, 2026Updated last month
- dLLM: Simple Diffusion Language Modeling☆2,123Feb 27, 2026Updated last week
- Entropy Based Sampling and Parallel CoT Decoding☆3,432Nov 13, 2024Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]☆28Feb 20, 2026Updated 2 weeks ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆53,029Updated this week
- Structured Outputs☆13,488Mar 2, 2026Updated last week
- Train transformer language models with reinforcement learning.☆17,523Updated this week
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆8,976Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,519Updated this week
- Async RL Training at Scale☆1,107Updated this week
- Pretraining and inference code for a large-scale depth-recurrent language model☆865Dec 29, 2025Updated 2 months ago
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,880Aug 13, 2025Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,883Updated this week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,659Feb 23, 2026Updated 2 weeks ago
- Go ahead and axolotl questions☆11,395Updated this week
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Feb 7, 2023Updated 3 years ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Apr 21, 2025Updated 10 months ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,788Dec 29, 2025Updated 2 months ago
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬☆12,273Dec 19, 2025Updated 2 months ago
- General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.☆2,896Mar 2, 2026Updated last week
- Code for BLT research paper☆2,029Nov 3, 2025Updated 4 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆108Nov 25, 2025Updated 3 months ago
- Democratizing Reinforcement Learning for LLMs☆5,196Updated this week
- Fully open reproduction of DeepSeek-R1☆25,927Nov 24, 2025Updated 3 months ago
- ☆144Sep 29, 2025Updated 5 months ago
- Tools for merging pretrained large language models.☆6,826Feb 28, 2026Updated last week
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆861Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆54,071Nov 12, 2025Updated 3 months ago
- Run frontier AI locally.☆41,955Mar 2, 2026Updated last week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,750Feb 12, 2026Updated 3 weeks ago