JinjieNi / OpenMoE2Links
The official repo for "OpenMoE 2: Sparse Diffusion Language Models".
☆51Updated last month
Alternatives and similar repositories for OpenMoE2
Users that are interested in OpenMoE2 are comparing it to the libraries listed below
Sorting:
- Easy and Efficient dLLM Fine-Tuning☆203Updated last week
- ☆137Updated last week
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆67Updated 4 months ago
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…☆98Updated this week
- This is the offical repository of InfiniteVL☆76Updated last month
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆206Updated 3 months ago
- [ICLR 2026] Geometric-Mean Policy Optimization☆98Updated this week
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Updated last year
- LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.☆236Updated last month
- ☆204Updated last month
- Official Repository of Native Parallel Reasoner☆100Updated last week
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆94Updated 2 weeks ago
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆398Updated last month
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆70Updated 2 weeks ago
- The official repo of VideoAgentTrek☆41Updated 3 months ago
- ☆110Updated 4 months ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆219Updated 2 months ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆191Updated last month
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆238Updated 2 weeks ago
- Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆79Updated last month
- Official PyTorch implementation of TokenSet.☆127Updated 10 months ago
- ☆35Updated 9 months ago
- ☆63Updated 6 months ago
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆236Updated 3 weeks ago
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆56Updated 3 months ago
- This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"☆57Updated last month
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆129Updated 8 months ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆78Updated 2 months ago
- QeRL enables RL for 32B LLMs on a single H100 GPU.☆477Updated 2 months ago
- Spectral Sphere Optimizer☆90Updated 2 weeks ago