StevenYuan666 / Awesome-Diffusion-Models-for-NLPLinks
π° Must-read papers on Diffusion Models for Text Generation π₯
β18Updated last year
Alternatives and similar repositories for Awesome-Diffusion-Models-for-NLP
Users that are interested in Awesome-Diffusion-Models-for-NLP are comparing it to the libraries listed below
Sorting:
- β34Updated 2 years ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]β98Updated 2 years ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"β27Updated last year
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".β31Updated last year
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)β55Updated last year
- β14Updated last year
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Leβ¦β75Updated last year
- β25Updated 2 months ago
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"β59Updated 2 years ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoningβ49Updated last year
- It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) iβ¦β63Updated last year
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".β59Updated last year
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)β57Updated 10 months ago
- LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMsβ34Updated last month
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Controlβ75Updated 2 years ago
- [AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learningβ15Updated last year
- Reference implementation for Token-level Direct Preference Optimization(TDPO)β147Updated 7 months ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utβ¦β22Updated 9 months ago
- Reparameterized Discrete Diffusion Models for Text Generationβ101Updated 2 years ago
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)β67Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$β48Updated 10 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"β39Updated 11 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)β61Updated last year
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Modelsβ23Updated 11 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"β83Updated last year
- A trainable user simulatorβ34Updated 2 months ago
- Sotopia-Ο: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)β77Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"β32Updated last year
- β140Updated last year
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).β16Updated 8 months ago