OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆176Jan 16, 2025Updated last year
Alternatives and similar repositories for OpenCoconut
Users that are interested in OpenCoconut are comparing it to the libraries listed below
Sorting:
- ☆26Jan 14, 2025Updated last year
- Training Large Language Model to Reason in a Continuous Latent Space☆1,522Aug 12, 2025Updated 6 months ago
- ☆15Apr 26, 2025Updated 10 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Mar 7, 2025Updated 11 months ago
- Recipes to scale inference-time compute of open models☆1,129May 22, 2025Updated 9 months ago
- PAHF Personalized Agent from Human Feedback☆31Feb 17, 2026Updated 2 weeks ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆30Aug 18, 2024Updated last year
- ☆40Jul 26, 2024Updated last year
- Deep research agents using MiniMax M2.1 interleaved thinking☆199Dec 23, 2025Updated 2 months ago
- Our library for RL environments + evals☆3,869Updated this week
- ☆31Sep 23, 2024Updated last year
- ☆338Jul 28, 2025Updated 7 months ago
- Training tiny models to prove hard theorems☆29Feb 15, 2026Updated 2 weeks ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆31Jun 5, 2025Updated 8 months ago
- ☆205Apr 19, 2025Updated 10 months ago
- Synthetic data curation for post-training and structured data extraction☆1,638Jan 24, 2026Updated last month
- Scalable RL solution for advanced reasoning of language models☆1,809Mar 18, 2025Updated 11 months ago
- Exploring Applications of GRPO☆251Aug 25, 2025Updated 6 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆459Sep 27, 2024Updated last year
- ☆19Mar 16, 2025Updated 11 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Nov 6, 2024Updated last year
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,352Jan 16, 2026Updated last month
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆51May 4, 2024Updated last year
- ☆137Mar 20, 2025Updated 11 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆169Mar 14, 2025Updated 11 months ago
- ☆133Dec 23, 2024Updated last year
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆41Feb 15, 2024Updated 2 years ago
- Entropy Based Sampling and Parallel CoT Decoding☆3,434Nov 13, 2024Updated last year
- An introduction to LLM Sampling☆79Dec 15, 2024Updated last year
- AI agent workflow for generating profiles of clients and running research tasks for them. There is an agent for each part of the process:…☆83Oct 20, 2024Updated last year
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated last year
- Plotting (entropy, varentropy) for small LMs☆99May 20, 2025Updated 9 months ago
- ☆24Apr 3, 2025Updated 11 months ago
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated 10 months ago
- ☆29Jul 14, 2025Updated 7 months ago
- A micro LLM multi-agent system for data analysis☆17Apr 27, 2025Updated 10 months ago
- ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…☆19Oct 11, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year