A collection of papers on discrete diffusion models
☆168Jun 30, 2025Updated 8 months ago
Alternatives and similar repositories for discrete-diffusion-papers
Users that are interested in discrete-diffusion-papers are comparing it to the libraries listed below
Sorting:
- (ICCV 2025) Enhance CLIP and MLLM's fine-grained visual representations with generative models.☆77Jun 25, 2025Updated 8 months ago
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]☆74Dec 17, 2025Updated 2 months ago
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆235Aug 18, 2025Updated 6 months ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)☆1,591Feb 14, 2026Updated 3 weeks ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,609Nov 12, 2025Updated 3 months ago
- Optimizing diffusion for production-ready speeds☆37Jan 10, 2026Updated last month
- [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs☆105Feb 26, 2026Updated last week
- ☆16May 2, 2023Updated 2 years ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆865Jan 28, 2026Updated last month
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Nov 17, 2024Updated last year
- EleutherAI ML Performance reading group repository (slides, meeting recordings, annotated papers)☆27Dec 19, 2025Updated 2 months ago
- Official PyTorch implementation of TokenSet.☆128Mar 21, 2025Updated 11 months ago
- Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?☆17Jan 6, 2021Updated 5 years ago
- [Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning☆163Sep 15, 2025Updated 5 months ago
- ICML2025☆63Aug 28, 2025Updated 6 months ago
- ☆40Jun 6, 2025Updated 9 months ago
- [ICCV2025] "Di[M]O: Distilling Masked Diffusion Models into One-step Generator", Yuanzhi Zhu, Xi Wang, Stéphane Lathuilière, Vicky Kal…☆32Aug 14, 2025Updated 6 months ago
- Dream 7B, a large diffusion language model☆1,193Nov 21, 2025Updated 3 months ago
- FireQ: Fast INT4-FP8 Kernel and RoPE-aware Quantization for LLM Inference Acceleration☆20Jun 27, 2025Updated 8 months ago
- [AAAI 2025] Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding☆34Mar 21, 2025Updated 11 months ago
- Evaluation codes and data for GenEval2☆58Jan 8, 2026Updated last month
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆59Jan 5, 2026Updated 2 months ago
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆22Jun 8, 2018Updated 7 years ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆1,346Feb 3, 2026Updated last month
- ☆27May 3, 2024Updated last year
- 基于ResNet的奶龙识别模型☆29Dec 27, 2024Updated last year
- The official code for ICML 2024 "FedREDefense: Defending against Model Poisoning Attacks for Federated Learning using Model Update Recons…☆29Jun 6, 2024Updated last year
- (ICCV 2025) "Principal Components" Enable A New Language of Images☆79Jul 28, 2025Updated 7 months ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆576Apr 13, 2025Updated 10 months ago
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆50Nov 4, 2025Updated 4 months ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆30Jan 28, 2026Updated last month
- ☆50Jun 4, 2025Updated 9 months ago
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)☆29Jan 1, 2024Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- ☆327Dec 16, 2025Updated 2 months ago
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆1,360Feb 26, 2026Updated last week