[ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"
☆63Dec 26, 2025Updated 5 months ago
Alternatives and similar repositories for ReFusion
Users that are interested in ReFusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆33Jan 27, 2026Updated 4 months ago
- ☆49May 16, 2026Updated 3 weeks ago
- ☆14Apr 25, 2025Updated last year
- ☆31May 30, 2025Updated last year
- ☆21Oct 4, 2025Updated 8 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".☆22Nov 17, 2025Updated 6 months ago
- ☆20Mar 18, 2026Updated 2 months ago
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆53Jul 11, 2025Updated 11 months ago
- ☆24May 23, 2025Updated last year
- ☆30Jun 23, 2025Updated 11 months ago
- ☆39May 20, 2025Updated last year
- [ICLR 2026 🔥] Official pytorch implementation for "Attention Is All You Need for KV Cache in Diffusion LLMs"☆42Jan 23, 2026Updated 4 months ago
- ☆11Jul 24, 2023Updated 2 years ago
- Official implementation for "Pruning Large Language Models with Semi-Structural Adaptive Sparse Training" (AAAI 2025)☆19Jul 1, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Repository for MetaVC -- A Meta Local Search Framework For Minimum Vertex Cover (MinVC)☆10Jan 15, 2022Updated 4 years ago
- Minimal PyTorch implementation of TP, SP, FSDP and sharded-EMA☆32Nov 27, 2025Updated 6 months ago
- AlignX-Family is an open-source research suite for advancing personalization in large language models-spanning data, code, models, and be…☆20Jan 12, 2026Updated 5 months ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- ☆15Feb 8, 2023Updated 3 years ago
- ☆19Jun 29, 2025Updated 11 months ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- ☆13Sep 20, 2020Updated 5 years ago
- ☆18Oct 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Nov 13, 2018Updated 7 years ago
- Multi-Layer Key-Value sharing experiments on Pythia models☆34Jun 14, 2024Updated last year
- ☆61Dec 10, 2025Updated 6 months ago
- ☆13Mar 9, 2024Updated 2 years ago
- bootstrap my zsh shell☆17Mar 28, 2026Updated 2 months ago
- Near-linear time algorithm for computing near-maximum independent set☆19Mar 19, 2022Updated 4 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- [ICML2026] ARLArena☆79May 2, 2026Updated last month
- (CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction☆148Mar 6, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Customized Inference Engine for Multiverse Models☆25Jun 27, 2025Updated 11 months ago
- ☆37Feb 12, 2025Updated last year
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- Code and Data for ACL 2025 Paper "Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework".☆25Oct 3, 2025Updated 8 months ago
- ☆17Nov 7, 2024Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Oct 3, 2024Updated last year
- MeshArt: Generating Articulated Meshes with Structure-Guided Transformers (CVPR2025)☆55Jun 9, 2025Updated last year