JinjieNi / OpenMoE2Links
The official repo for "OpenMoE 2: Sparse Diffusion Language Models".
☆49Updated last month
Alternatives and similar repositories for OpenMoE2
Users that are interested in OpenMoE2 are comparing it to the libraries listed below
Sorting:
- Easy and Efficient dLLM Fine-Tuning☆168Updated last week
- LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.☆164Updated this week
- The official github repo for "Diffusion Language Models are Super Data Learners".☆212Updated last month
- Geometric-Mean Policy Optimization☆95Updated last month
- ☆79Updated last month
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆363Updated this week
- The official repo of VideoAgentTrek☆35Updated last month
- ☆107Updated 3 months ago
- Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.☆95Updated last month
- ☆95Updated 2 weeks ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 11 months ago
- SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention☆176Updated this week
- QeRL enables RL for 32B LLMs on a single H100 GPU.☆467Updated 3 weeks ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆65Updated 3 months ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆214Updated 2 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆193Updated 2 months ago
- ☆126Updated this week
- ☆64Updated 5 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆125Updated 7 months ago
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆177Updated last week
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆186Updated this week
- Official PyTorch implementation of TokenSet.☆127Updated 9 months ago
- Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆72Updated 2 weeks ago
- d3LLM: Ultra-Fast Diffusion LLM 🚀☆38Updated this week
- [Arxiv 2025] SparseD: Sparse Attention for Diffusion Language Models☆51Updated 2 months ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆187Updated last month
- This is the offical repository of InfiniteVL☆54Updated this week
- [NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Tok…☆69Updated this week
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆31Updated 2 months ago
- ☆62Updated 5 months ago