JinjieNi / MegaDLMsView external linksLinks
GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 training.
☆322Nov 11, 2025Updated 3 months ago
Alternatives and similar repositories for MegaDLMs
Users that are interested in MegaDLMs are comparing it to the libraries listed below
Sorting:
- The official implementation of dLLM-Var☆31Nov 6, 2025Updated 3 months ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆221Nov 6, 2025Updated 3 months ago
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆20Oct 29, 2025Updated 3 months ago
- The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆519Nov 11, 2025Updated 3 months ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆45Nov 6, 2025Updated 3 months ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆241Feb 3, 2026Updated last week
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆129May 22, 2025Updated 8 months ago
- On demand communication☆32Feb 4, 2026Updated last week
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆84Jan 24, 2024Updated 2 years ago
- Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support☆288Updated this week
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆26Feb 4, 2026Updated last week
- [ICLR 2026] TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆423Jan 28, 2026Updated 2 weeks ago
- [ICLR'26] Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs☆97Jan 26, 2026Updated 2 weeks ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆59Jan 5, 2026Updated last month
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 3 weeks ago
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 3 months ago
- Solution for N+1 fish, N+2 fish DrivenData competition (2nd place)☆13Sep 12, 2019Updated 6 years ago
- ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents, NeurIPS 2025☆33Nov 15, 2025Updated 2 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,554Nov 12, 2025Updated 3 months ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆833Jan 28, 2026Updated 2 weeks ago
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]☆73Dec 17, 2025Updated last month
- List of free and checked http, https, socks4 and socks5 proxies☆17Jan 13, 2026Updated last month
- [Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey☆361Nov 1, 2025Updated 3 months ago
- ☆12Jan 2, 2024Updated 2 years ago
- ☆16Oct 12, 2025Updated 4 months ago
- [NeurIPS 2025 D&B (Spotlight🌟)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario☆29Oct 5, 2025Updated 4 months ago
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆28Dec 10, 2025Updated 2 months ago
- CoV: Chain-of-View Prompting for Spatial Reasoning☆50Jan 23, 2026Updated 3 weeks ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆101Feb 3, 2026Updated last week
- The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".☆755Feb 3, 2026Updated last week
- VehicleWorld is the first comprehensive multi-device environment for intelligent vehicle interaction that accurately models the complex, …☆21Sep 16, 2025Updated 4 months ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 4 months ago
- A lightweight, reproducible toolkit for LLM-based query reformulation.☆29Jan 3, 2026Updated last month
- Python library to add support for embedding natural code in Python with shared program state.☆23Jan 20, 2026Updated 3 weeks ago
- ☆13Feb 25, 2025Updated 11 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated last year
- ☆145Jan 20, 2026Updated 3 weeks ago
- ☆439Jan 29, 2026Updated 2 weeks ago
- Data recipes and robust infrastructure for training AI agents☆94Updated this week