facebookresearch / coconut
Training Large Language Model to Reason in a Continuous Latent Space
☆1,062Updated 2 months ago
Alternatives and similar repositories for coconut:
Users that are interested in coconut are comparing it to the libraries listed below
- Recipes to scale inference-time compute of open models☆1,055Updated last month
- A bibliography and survey of the papers surrounding o1☆1,187Updated 5 months ago
- Verifiers for LLM Reinforcement Learning☆813Updated 3 weeks ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆863Updated last week
- Pretraining code for a large-scale depth-recurrent language model☆743Updated last week
- ☆1,015Updated 4 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆1,382Updated this week
- ☆518Updated last week
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,040Updated 2 months ago
- LIMO: Less is More for Reasoning☆913Updated 2 weeks ago
- Large Reasoning Models☆802Updated 4 months ago
- procedural reasoning datasets☆565Updated this week
- An Open Large Reasoning Model for Real-World Solutions☆1,484Updated last month
- ☆920Updated 2 months ago
- System 2 Reasoning Link Collection☆826Updated last month
- Code for BLT research paper☆1,513Updated this week
- ☆1,355Updated 5 months ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,141Updated last week
- Synthetic data curation for post-training and structured data extraction☆1,230Updated this week
- ☆630Updated 3 weeks ago
- A reading list on LLM based Synthetic Data Generation 🔥☆1,246Updated 2 months ago
- OLMoE: Open Mixture-of-Experts Language Models☆716Updated last month
- Official PyTorch implementation for "Large Language Diffusion Models"☆1,492Updated 2 weeks ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆1,928Updated last week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,438Updated this week
- Official Repo for Open-Reasoner-Zero☆1,872Updated 2 weeks ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆991Updated last month
- A library for advanced large language model reasoning☆2,099Updated last week
- Muon is Scalable for LLM Training☆1,022Updated 3 weeks ago
- Dream 7B, a large diffusion language model☆551Updated last week