hanyang1999 / discrete-diffusion-papersView external linksLinks
A collection of papers on discrete diffusion models
☆168Jun 30, 2025Updated 7 months ago
Alternatives and similar repositories for discrete-diffusion-papers
Users that are interested in discrete-diffusion-papers are comparing it to the libraries listed below
Sorting:
- A Collection of Papers on Diffusion Language Models☆157Sep 15, 2025Updated 4 months ago
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]☆73Dec 17, 2025Updated last month
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆236Aug 18, 2025Updated 5 months ago
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models☆1,574Nov 16, 2025Updated 2 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,554Nov 12, 2025Updated 3 months ago
- [ICML 2025] The Diffusion Duality☆187Dec 27, 2025Updated last month
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆833Jan 28, 2026Updated 2 weeks ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆78Dec 18, 2024Updated last year
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Nov 17, 2024Updated last year
- Official PyTorch implementation of TokenSet.☆127Mar 21, 2025Updated 10 months ago
- ☆42Aug 5, 2025Updated 6 months ago
- Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?☆17Jan 6, 2021Updated 5 years ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆61Updated this week
- ICML2025☆63Aug 28, 2025Updated 5 months ago
- ☆40Jun 6, 2025Updated 8 months ago
- Dream 7B, a large diffusion language model☆1,164Nov 21, 2025Updated 2 months ago
- Evaluation codes and data for GenEval2☆55Jan 8, 2026Updated last month
- ☆27Apr 11, 2023Updated 2 years ago
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆22Jun 8, 2018Updated 7 years ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆1,329Feb 3, 2026Updated last week
- The official code for ICML 2024 "FedREDefense: Defending against Model Poisoning Attacks for Federated Learning using Model Update Recons…☆29Jun 6, 2024Updated last year
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆166Nov 5, 2024Updated last year
- (ICCV 2025) "Principal Components" Enable A New Language of Images☆78Jul 28, 2025Updated 6 months ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆575Apr 13, 2025Updated 10 months ago
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 2 years ago
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆50Nov 4, 2025Updated 3 months ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆30Jan 28, 2026Updated 2 weeks ago
- ☆319Dec 16, 2025Updated last month
- Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos☆28Dec 8, 2023Updated 2 years ago
- Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)☆29Jan 1, 2024Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆1,349Dec 7, 2025Updated 2 months ago
- tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)☆29Dec 12, 2023Updated 2 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32May 15, 2023Updated 2 years ago
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆32Nov 29, 2024Updated last year
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆195Dec 17, 2025Updated last month
- Code repository for "Multi-Task Encoder-Dual-Decoder Modeling Framework on Mixed Frequency Data", International Journal of Forecasting, 2…☆12Feb 18, 2024Updated last year
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆234Jan 22, 2026Updated 3 weeks ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆402Jan 26, 2026Updated 2 weeks ago