RUCAIBox / Awesome-Text-Diffusion-ModelsLinks
[IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".
☆54Updated last year
Alternatives and similar repositories for Awesome-Text-Diffusion-Models
Users that are interested in Awesome-Text-Diffusion-Models are comparing it to the libraries listed below
Sorting:
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆96Updated last year
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆70Updated 2 years ago
- Source code of LatentOps☆78Updated last year
- ☆25Updated 2 years ago
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆54Updated last year
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated 2 years ago
- ☆31Updated last year
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆57Updated last year
- [ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆28Updated 10 months ago
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆123Updated last year
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆153Updated 2 years ago
- ☆34Updated last year
- A paper list about diffusion models for natural language processing.☆182Updated last year
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆55Updated 11 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆55Updated last year
- Reparameterized Discrete Diffusion Models for Text Generation☆99Updated 2 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆72Updated 2 years ago
- Official Implementation for the ICML2022 paper "Directed Acyclic Transformer for Non-Autoregressive Machine Translation"☆125Updated last year
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆164Updated last year
- The original Backpack Language Model implementation, a fork of FlashAttention☆69Updated 2 years ago
- ☆34Updated 2 years ago
- This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"☆39Updated 2 years ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆39Updated last year
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆23Updated 10 months ago
- Language modeling via stochastic processes. Oral @ ICLR 2022.☆138Updated 2 years ago
- Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)☆42Updated last year
- Official code for the NAACL 2022 paper "Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text…☆35Updated 2 years ago
- Do Large Language Models Know What They Don’t Know?☆97Updated 7 months ago
- ☆136Updated last year
- contrastive decoding☆201Updated 2 years ago