bansky-cl / diffusion-nlp-paper-arxiv
Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion_NLP_Papers".
☆55Updated this week
Related projects: ⓘ
- ☆32Updated last year
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆49Updated 4 months ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆86Updated last year
- Source code of LatentOps☆77Updated 10 months ago
- ☆98Updated 6 months ago
- Reparameterized Discrete Diffusion Models for Text Generation☆90Updated last year
- ☆68Updated 10 months ago
- Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆65Updated 6 months ago
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆63Updated last year
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆96Updated last year
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".☆47Updated 3 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆61Updated last year
- ☆127Updated 2 years ago
- ☆25Updated last year
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆78Updated last week
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".☆19Updated 8 months ago
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆56Updated 7 months ago
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆80Updated 3 years ago
- contrastive decoding☆174Updated last year
- 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆52Updated 3 weeks ago
- Language modeling via stochastic processes. Oral @ ICLR 2022.☆134Updated last year
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasks…☆31Updated 8 months ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆64Updated 6 months ago
- ☆60Updated 2 years ago
- ☆21Updated 7 months ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆78Updated last year
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆54Updated 9 months ago
- ☆40Updated 5 months ago
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆23Updated last month