Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"
☆84Jan 24, 2024Updated 2 years ago
Alternatives and similar repositories for DiffusionLLM
Users that are interested in DiffusionLLM are comparing it to the libraries listed below
Sorting:
- Diffusion Model Improvement Method☆34Sep 4, 2023Updated 2 years ago
- ☆25Jul 15, 2025Updated 7 months ago
- ☆113May 29, 2023Updated 2 years ago
- Reparameterized Discrete Diffusion Models for Text Generation☆104Feb 14, 2023Updated 3 years ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆205Mar 4, 2025Updated 11 months ago
- [ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models☆828Mar 1, 2024Updated 2 years ago
- A simple DIffusion LM approach.☆27May 22, 2023Updated 2 years ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆82May 30, 2025Updated 9 months ago
- ☆149Feb 27, 2024Updated 2 years ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".☆31Dec 21, 2023Updated 2 years ago
- ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models☆339Feb 17, 2024Updated 2 years ago
- [Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …☆63Oct 9, 2024Updated last year
- ☆21Sep 5, 2023Updated 2 years ago
- Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text☆24Aug 15, 2022Updated 3 years ago
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".☆272Updated this week
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆22Aug 5, 2024Updated last year
- Pytorch version of Continuous Language Generative Flow (ACL 2021)☆11Sep 14, 2021Updated 4 years ago
- ☆24Sep 25, 2024Updated last year
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆46Oct 15, 2023Updated 2 years ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- ☆10Nov 23, 2023Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Jun 11, 2023Updated 2 years ago
- ☆13May 21, 2023Updated 2 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"☆10Sep 26, 2022Updated 3 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆25Nov 6, 2023Updated 2 years ago
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆641Mar 4, 2024Updated last year
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆77Mar 24, 2025Updated 11 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- ☆33Oct 4, 2024Updated last year
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆33Oct 16, 2023Updated 2 years ago
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆76Nov 13, 2022Updated 3 years ago
- Recursive Bayesian Networks☆11May 11, 2025Updated 9 months ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year