StevenYuan666 / Awesome-Diffusion-Models-for-NLPLinks
π° Must-read papers on Diffusion Models for Text Generation π₯
β18Updated last year
Alternatives and similar repositories for Awesome-Diffusion-Models-for-NLP
Users that are interested in Awesome-Diffusion-Models-for-NLP are comparing it to the libraries listed below
Sorting:
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]β98Updated 2 years ago
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".β31Updated 2 years ago
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"β59Updated 2 years ago
- Diffusion Model Improvement Methodβ34Updated 2 years ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"β27Updated last year
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)β55Updated last year
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Leβ¦β75Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$β50Updated last year
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".β60Updated last year
- β25Updated 5 months ago
- Reparameterized Discrete Diffusion Models for Text Generationβ104Updated 2 years ago
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Controlβ76Updated 3 years ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoningβ52Updated last year
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)β62Updated last year
- [NeurIPS2023] Official code of "Understanding Contrastive Learning via Distributionally Robust Optimization"β41Updated 2 years ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Modelsβ24Updated last year
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)β35Updated 5 months ago
- β148Updated last year
- β111Updated 2 years ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".β63Updated 4 months ago
- It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) iβ¦β67Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"β33Updated last year
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"β84Updated last year
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswalβ¦β56Updated 2 years ago
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.β75Updated 9 months ago
- Official code repo for NeurIPS 2025 Spotlight paper, "Debate or Vote: Which Yields Better Decisions in Multi-Agent LLMs?"β40Updated 2 months ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).β17Updated last year
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignmentβ69Updated 2 years ago
- β29Updated 2 months ago
- [NeurIPS 2024] GITA: Graph to Image-Text Integration for Vision-Language Graph Reasoningβ53Updated last month