GMLR-Penn / Multiplex-ThinkingLinks
Multiplex Thinking
☆48Updated last week
Alternatives and similar repositories for Multiplex-Thinking
Users that are interested in Multiplex-Thinking are comparing it to the libraries listed below
Sorting:
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 7 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆57Updated 3 months ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆64Updated 3 weeks ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆64Updated last week
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆119Updated 2 weeks ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆56Updated 2 months ago
- Defeating the Training-Inference Mismatch via FP16☆179Updated 2 months ago
- Geometric-Mean Policy Optimization☆97Updated 2 months ago
- ☆128Updated 2 weeks ago
- Official Repository of Native Parallel Reasoner☆98Updated last week
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆51Updated last month
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Updated 3 weeks ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆56Updated 3 weeks ago
- ☆110Updated 4 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆35Updated 3 months ago
- ☆64Updated 3 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated 8 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆89Updated 7 months ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆36Updated last month
- Esoteric Language Models☆109Updated 2 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- Process Reward Models That Think☆77Updated last month
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆190Updated 6 months ago
- Reinforcing General Reasoning without Verifiers☆93Updated 7 months ago
- The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated 4 months ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆218Updated 2 months ago
- Resa: Transparent Reasoning Models via SAEs☆47Updated 4 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated last year
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆89Updated 3 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Updated 9 months ago