Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge
☆121Apr 27, 2026Updated 3 weeks ago
Alternatives and similar repositories for Multiplex-Thinking
Users that are interested in Multiplex-Thinking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆336Jan 26, 2026Updated 3 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆64Mar 17, 2026Updated 2 months ago
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆25Oct 8, 2024Updated last year
- [EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation☆15Aug 20, 2025Updated 9 months ago
- Official implementation for Text Generation Beyond Discrete Token Sampling☆25Aug 11, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆22Dec 11, 2024Updated last year
- KARL: Knowledge-Aware Reasoning and Reinforcement Learning for Knowledge-Intensive Visual Grounding☆68Apr 5, 2026Updated last month
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year
- The official code respository for "Crystalformer: Infinitely Connected Attention for Periodic Structure Encoding" (ICLR 2024)☆28Mar 8, 2025Updated last year
- Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning☆24Jun 25, 2025Updated 10 months ago
- ☆29Mar 13, 2026Updated 2 months ago
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆13May 5, 2025Updated last year
- ☆58Jan 31, 2026Updated 3 months ago
- The repo contains the code and dataset for the World Models Track of GigaBrain Challenge 2026 CVPR Workshop.☆59Apr 8, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆84Sep 13, 2025Updated 8 months ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated 2 years ago
- Code for paper ”Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability“☆15Jun 13, 2023Updated 2 years ago
- [ICCV2025] WikiAutoGen offical page☆26Feb 6, 2026Updated 3 months ago
- Revisiting End-to-End Speech-to-Text Translation From Scratch☆13Feb 21, 2023Updated 3 years ago
- ☆39Jul 16, 2025Updated 10 months ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 7 months ago
- [NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- Newton's simulation environment using Nvidia's Isaac Sim☆37May 26, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of Decision Stacks: Flexible RL via Modular Generative Models [NeurIPS 2023]☆12Jun 27, 2023Updated 2 years ago
- Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"☆175Mar 18, 2026Updated 2 months ago
- Benchmarking Social Intelligence of Language Agents through Interactive Scenarios☆13Jan 4, 2025Updated last year
- ☆19May 17, 2025Updated last year
- ☆14Apr 30, 2025Updated last year
- [ACL 2026] Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning☆92Jan 22, 2026Updated 4 months ago
- TBD☆56Mar 13, 2026Updated 2 months ago
- ☆15Mar 8, 2024Updated 2 years ago
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆21Oct 22, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16Mar 7, 2025Updated last year
- ☆11Jun 7, 2023Updated 2 years ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆39May 9, 2024Updated 2 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆61May 10, 2021Updated 5 years ago
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆29May 9, 2026Updated 2 weeks ago
- ☆14Sep 22, 2025Updated 8 months ago
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations☆22Dec 24, 2025Updated 4 months ago