OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆176Jan 16, 2025Updated last year
Alternatives and similar repositories for OpenCoconut
Users that are interested in OpenCoconut are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆26Jan 14, 2025Updated last year
- Training Large Language Model to Reason in a Continuous Latent Space☆1,536Aug 12, 2025Updated 7 months ago
- ☆15Apr 26, 2025Updated 10 months ago
- ☆206Apr 19, 2025Updated 11 months ago
- Recipes to scale inference-time compute of open models☆1,130May 22, 2025Updated 10 months ago
- ☆16Mar 22, 2025Updated last year
- ☆136Dec 23, 2024Updated last year
- ☆13Nov 4, 2025Updated 4 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆190Mar 7, 2025Updated last year
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆31Aug 18, 2024Updated last year
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆52May 7, 2025Updated 10 months ago
- Minimal Claude Code alternative powered by MLX☆46Jan 11, 2026Updated 2 months ago
- ☆137Mar 20, 2025Updated last year
- Our library for RL environments + evals☆3,918Updated this week
- Scalable RL solution for advanced reasoning of language models☆1,821Mar 18, 2025Updated last year
- Training tiny models to prove hard theorems☆64Mar 5, 2026Updated 2 weeks ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆465Sep 27, 2024Updated last year
- An introduction to LLM Sampling☆80Dec 15, 2024Updated last year
- Synthetic data curation for post-training and structured data extraction☆1,646Updated this week
- ☆12Jul 8, 2024Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated last year
- ☆35Oct 23, 2025Updated 5 months ago
- PAHF Personalized Agent from Human Feedback☆44Mar 6, 2026Updated 2 weeks ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,368Mar 15, 2026Updated last week
- Repo of paper "Free Process Rewards without Process Labels"☆170Mar 14, 2025Updated last year
- Deep research agents using MiniMax M2.1 interleaved thinking☆203Dec 23, 2025Updated 3 months ago
- ☆120Jun 11, 2025Updated 9 months ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Sep 13, 2024Updated last year
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦 𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆51May 4, 2024Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆3,434Nov 13, 2024Updated last year
- ☆40Jul 26, 2024Updated last year
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 6 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆21Nov 17, 2025Updated 4 months ago
- ☆26Nov 13, 2025Updated 4 months ago
- Simple RL training for reasoning☆3,841Dec 23, 2025Updated 3 months ago