bansky-cl / Diffusion-LM-Papers
Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.
☆39Updated 2 weeks ago
Alternatives and similar repositories for Diffusion-LM-Papers:
Users that are interested in Diffusion-LM-Papers are comparing it to the libraries listed below
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".☆107Updated this week
- ☆128Updated last year
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆54Updated 11 months ago
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".☆27Updated last year
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆68Updated 2 years ago
- Reparameterized Discrete Diffusion Models for Text Generation☆96Updated 2 years ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆139Updated last month
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆72Updated last year
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆51Updated 4 months ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data"☆35Updated last month
- ☆93Updated last year
- ☆82Updated last year
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆93Updated last year
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".☆49Updated 10 months ago
- ☆33Updated last year
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆22Updated 9 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆124Updated 2 weeks ago
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆151Updated 3 months ago
- ☆25Updated last year
- "Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiw…☆29Updated 11 months ago
- AnchorAttention: Improved attention for LLMs long-context training☆206Updated 2 months ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆54Updated last week
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆53Updated 9 months ago
- Stick-breaking attention☆49Updated 3 weeks ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆71Updated last year
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆85Updated 3 weeks ago
- ☆73Updated last week
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆82Updated 6 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆130Updated last month
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆56Updated last year