StevenYuan666 / Awesome-Diffusion-Models-for-NLPLinks
π° Must-read papers on Diffusion Models for Text Generation π₯
β18Updated last year
Alternatives and similar repositories for Awesome-Diffusion-Models-for-NLP
Users that are interested in Awesome-Diffusion-Models-for-NLP are comparing it to the libraries listed below
Sorting:
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]β96Updated last year
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)β55Updated last year
- β34Updated last year
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"β58Updated last year
- β25Updated 3 weeks ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"β27Updated last year
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Leβ¦β75Updated last year
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Controlβ74Updated 2 years ago
- β139Updated last year
- Reparameterized Discrete Diffusion Models for Text Generationβ100Updated 2 years ago
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".β31Updated last year
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"β83Updated last year
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".β59Updated last year
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Modelsβ21Updated 10 months ago
- A replication of Diffusion-LM Improves Controllable Text Generationβ18Updated 2 years ago
- β27Updated last year
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"β173Updated 5 months ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).β16Updated 7 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswalβ¦β53Updated 2 years ago
- Source code of LatentOpsβ78Updated last year
- Reference implementation for Token-level Direct Preference Optimization(TDPO)β146Updated 5 months ago
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.β60Updated 4 months ago
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.β72Updated 5 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".β59Updated last week
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"β38Updated 9 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"β32Updated last year
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".β160Updated this week
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional Stβ¦β23Updated last year
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)β61Updated last year
- Mixture of Attention Headsβ48Updated 2 years ago