yegcjs/DiffusionLLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yegcjs/DiffusionLLM)

yegcjs / DiffusionLLM

Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"

☆84

Alternatives and similar repositories for DiffusionLLM

Users that are interested in DiffusionLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LCM-Lab / Bridge_Gap_Diffusion
View on GitHub
Diffusion Model Improvement Method
☆35Sep 4, 2023Updated 2 years ago
yegcjs / DINOISER
View on GitHub
☆26Jul 15, 2025Updated last year
HKUNLP / reparam-discrete-diffusion
View on GitHub
Reparameterized Discrete Diffusion Models for Text Generation
☆108Feb 14, 2023Updated 3 years ago
igul222 / plaid
View on GitHub
☆115May 29, 2023Updated 3 years ago
justinlovelace / latent-diffusion-for-language
View on GitHub
☆157Feb 27, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Shark-NLP / DiffuSeq
View on GitHub
[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
☆837Mar 1, 2024Updated 2 years ago
zhjgao / difformer
View on GitHub
The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)
☆56Apr 23, 2024Updated 2 years ago
HKUNLP / diffusion-of-thoughts
View on GitHub
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
☆213Mar 4, 2025Updated last year
thorinf / simple-diffusion-lm
View on GitHub
A simple DIffusion LM approach.
☆27May 22, 2023Updated 3 years ago
AoiDragon / Awesome-Text-Diffusion-Models
View on GitHub
[IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".
☆33Dec 21, 2023Updated 2 years ago
Hzfinfdu / Diffusion-BERT
View on GitHub
ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
☆342Feb 17, 2024Updated 2 years ago
VPeterV / RankSpace-Models
View on GitHub
source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"
☆10Sep 26, 2022Updated 3 years ago
ZetangForward / CMD-Context-aware-Model-self-Detoxification
View on GitHub
CMD: a framework for Context-aware Model self-Detoxification (EMNLP2024 Long Paper)
☆17Feb 10, 2025Updated last year
hsiehjackson / Mr.Right
View on GitHub
Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text
☆24Aug 15, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lzw-lzw / UnifiedMLLM
View on GitHub
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
☆22Aug 5, 2024Updated last year
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
LCM-Lab / LOGO
View on GitHub
Code for paper: Long cOntext aliGnment via efficient preference Optimization
☆26Oct 10, 2025Updated 9 months ago
UCSC-VLAA / Sight-Beyond-Text
View on GitHub
[TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"
☆20Sep 15, 2023Updated 2 years ago
LCM-Lab / L-CITEEVAL
View on GitHub
Evaluating the faithfulness of long-context language models
☆30Oct 21, 2024Updated last year
robert-lieck / RBN
View on GitHub
Recursive Bayesian Networks
☆11May 11, 2025Updated last year
bansky-cl / Diffusion-LM-Papers
View on GitHub
Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.
☆79Mar 24, 2025Updated last year
cambridgeltl / multi3woz
View on GitHub
The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapte…
☆17Jan 15, 2024Updated 2 years ago
wutong4012 / AR-Diffusion
View on GitHub
[NIPS 2023] AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation
☆12May 19, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
aranciokov / FSMMDA_VideoRetrieval
View on GitHub
☆10Nov 23, 2023Updated 2 years ago
dangxingyu / rnn-icrag
View on GitHub
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Apr 17, 2024Updated 2 years ago
THUNLP-MT / PromptGating4MCTG
View on GitHub
This is the repo for our work “An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation” (ACL 2023).
☆14Jul 23, 2023Updated 2 years ago
allenai / EmbeddingRecycling
View on GitHub
Embedding Recycling for Language models
☆38Jul 11, 2023Updated 3 years ago
kakaobrain / hqtransformer
View on GitHub
Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)
☆29Feb 14, 2024Updated 2 years ago
algo-reasoning / algo-reasoning.github.io
View on GitHub
Neural Algorithmic Reasoning Tutorial
☆11Dec 21, 2022Updated 3 years ago
yyDing1 / GNER
View on GitHub
[ACL 2024 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"
☆60Mar 20, 2024Updated 2 years ago
ByungKwanLee / Phantom
View on GitHub
[Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …
☆63Oct 9, 2024Updated last year
0uO / Dual-learning
View on GitHub
Implementation of Dual Learning NMT & Joint Training on tensorflow
☆12Dec 29, 2018Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
StanfordMIMI / villa
View on GitHub
[ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data
☆45Oct 15, 2023Updated 2 years ago
whyNLP / LCKV
View on GitHub
Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…
☆157Apr 7, 2025Updated last year
UniX-AI-Lab / WorldReasonBench
View on GitHub
WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors
☆22May 19, 2026Updated 2 months ago
GATECH-EIC / Linearized-LLM
View on GitHub
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
☆35Jun 12, 2024Updated 2 years ago
princeton-nlp / LLM-Shearing
View on GitHub
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆640Mar 4, 2024Updated 2 years ago
e-bug / fine-grained-evals
View on GitHub
[ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"
☆13Jun 11, 2023Updated 3 years ago
trestad / mitigating-reversal-curse
View on GitHub
Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'
☆14Aug 2, 2024Updated last year