RUCAIBox / Awesome-Text-Diffusion-Models
[IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".
☆47Updated 3 months ago
Related projects: ⓘ
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆63Updated last year
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆86Updated last year
- Source code of LatentOps☆77Updated 10 months ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated last year
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆49Updated 4 months ago
- ☆25Updated last year
- ☆32Updated last year
- ☆32Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆62Updated 7 months ago
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆50Updated last year
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆150Updated last year
- ☆155Updated last month
- The original Backpack Language Model implementation, a fork of FlashAttention☆63Updated last year
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆31Updated 5 months ago
- Official code for the NAACL 2022 paper "Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text…☆32Updated 2 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆61Updated last year
- Language modeling via stochastic processes. Oral @ ICLR 2022.☆134Updated last year
- A paper list about diffusion models for natural language processing.☆170Updated last year
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆36Updated 2 weeks ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆133Updated 3 months ago
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆111Updated 6 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆59Updated 7 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆33Updated 6 months ago
- [ACL 2024] A Prospector of Long-Dependency Data for Large Language Models☆48Updated last month
- contrastive decoding☆174Updated last year
- ☆26Updated last year
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆33Updated 2 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆72Updated 6 months ago
- [Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.☆50Updated last year
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆52Updated 3 months ago