GMLR-Penn/Multiplex-Thinking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GMLR-Penn/Multiplex-Thinking)

GMLR-Penn / Multiplex-Thinking

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

☆131

Alternatives and similar repositories for Multiplex-Thinking

Users that are interested in Multiplex-Thinking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UCSB-AI / Soft-Thinking
View on GitHub
Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"
☆345Jun 12, 2026Updated last month
xiaomi-research / colar
View on GitHub
[NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
☆97Jun 29, 2026Updated 3 weeks ago
DJC-GO-SOLO / Latent-SFT
View on GitHub
Official implementation of Latent-SFT: teaching LLMs to reason with vocabulary-space latent chains.
☆55May 18, 2026Updated 2 months ago
digailab / awesome-llm-implicit-reasoning
View on GitHub
☆117Jan 11, 2026Updated 6 months ago
pUmpKin-Co / ComplementaryRL
View on GitHub
Co-evolving policy actors and experience extractors for efficient experience-driven agent RL
☆51May 12, 2026Updated 2 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Trae1ounG / Pretrain_Space_RLVR
View on GitHub
[arxiv: 2604.14142] From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
☆17Apr 16, 2026Updated 3 months ago
bigai-nlco / LatentSeek
View on GitHub
Official Repository of LatentSeek
☆85Jun 6, 2025Updated last year
wenhaochai / claude-plugins
View on GitHub
Personal Claude Code plugin marketplace
☆16Updated this week
lasgroup / SDPO
View on GitHub
Reinforcement Learning via Self-Distillation (SDPO)
☆1,021Jul 1, 2026Updated 3 weeks ago
zz1358m / ATP-Latent-master
View on GitHub
☆17Feb 4, 2026Updated 5 months ago
enkeejunior1 / min-pi-flow
View on GitHub
☆56Nov 6, 2025Updated 8 months ago
applese233 / ICRL
View on GitHub
In-Context Reinforcement Learning for Tool Use in Large Language Models
☆48Mar 26, 2026Updated 3 months ago
zz1358m / SofT-GRPO-master
View on GitHub
Code for the SofT-GRPO algorithm on the LLM soft-thinking reasoning pattern.
☆52Jan 2, 2026Updated 6 months ago
FYYDCC / IVT-LR
View on GitHub
Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”
☆18Jan 27, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Simplified-Reasoning / LUFFY
View on GitHub
Official Repository of "Learning to Reason under Off-Policy Guidance"
☆460Mar 20, 2026Updated 4 months ago
GregxmHu / OccuBench
View on GitHub
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
☆21Apr 14, 2026Updated 3 months ago
zhenyi4 / codi
View on GitHub
Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"
☆102Dec 15, 2025Updated 7 months ago
apple / ml-tada
View on GitHub
☆17Jul 3, 2025Updated last year
encoreus / GS-Jacobi_for_TarFlow
View on GitHub
Accelerate TarFlow Sampling with GS-Jacobi Iteration
☆17Dec 13, 2025Updated 7 months ago
PRIME-RL / Entropy-Mechanism-of-RL
View on GitHub
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆444Jul 11, 2025Updated last year
MasterVito / SvS
View on GitHub
Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training
☆54Dec 13, 2025Updated 7 months ago
Yueeeeeeee / HRPO
View on GitHub
[NeurIPS 2025] Hybrid Latent Reasoning via Reinforcement Learning
☆196Sep 15, 2025Updated 10 months ago
TuringEyeTest / TuringEyeTest
View on GitHub
Pixels, Patterns, but no Poetry: To See the World like Humans
☆18Aug 11, 2025Updated 11 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
test-time-training / discover
View on GitHub
☆611May 24, 2026Updated 2 months ago
hhh675597 / revisiting_opd
View on GitHub
[COLM 2026] Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes
☆126May 19, 2026Updated 2 months ago
OpenMOSS / ABC-Bench
View on GitHub
ABC-Bench is a benchmark for Agentic Backend Coding. It evaluates whether code agents can explore real repositories, edit code, configure…
☆33Jan 20, 2026Updated 6 months ago
thematrixmaster / edit-flows-demo
View on GitHub
Educational Implementation of "Edit Flows: Flow Matching with Edit Operations" by Havasi et al.
☆46Oct 17, 2025Updated 9 months ago
D2I-ai / dasd-thinking
View on GitHub
☆105Jan 27, 2026Updated 5 months ago
LUMIA-Group / PonderingLM
View on GitHub
Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"
☆26Jul 21, 2025Updated last year
facebookresearch / coconut
View on GitHub
Training Large Language Model to Reason in a Continuous Latent Space
☆1,667Jul 2, 2026Updated 3 weeks ago
Trae1ounG / BuPO
View on GitHub
[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
☆60Feb 6, 2026Updated 5 months ago
Xuekai-Zhu / FlowRL
View on GitHub
☆180Nov 24, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
multimodal-art-projection / LatentCoT-Horizon
View on GitHub
📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.
☆405Nov 5, 2025Updated 8 months ago
hustvl / MoDA
View on GitHub
An hardware-aware Efficient Implementation for "Mixture-of-Depths Attention".
☆274May 6, 2026Updated 2 months ago
aaronserianni / attention-iou
View on GitHub
[CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps
☆13Mar 26, 2025Updated last year
XIAO4579 / PRISM
View on GitHub
Beyond SFT-to-RL: Pre-alignment via Black-BoxOn-Policy Distillation for Multimodal RL
☆97May 6, 2026Updated 2 months ago
xlyu0106 / Awesome-Latent-Space
View on GitHub
A paper list of Awesome Latent Space.
☆946Jul 13, 2026Updated last week
GAIR-NLP / lm-open-science-evaluation
View on GitHub
Reproducible and flexible LLM evaluations for scientific reasoning.
☆29Jul 23, 2025Updated last year
w-yibo / VTC-R1
View on GitHub
VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning.
☆26Updated this week