bansky-cl / diffusion-nlp-paper-arxivLinks

Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".

☆160

Alternatives and similar repositories for diffusion-nlp-paper-arxiv

Users that are interested in diffusion-nlp-paper-arxiv are comparing it to the libraries listed below

Sorting:

bansky-cl / Diffusion-LM-Papers
Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.
☆60Updated 4 months ago
HKUNLP / diffusion-of-thoughts
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
☆173Updated 5 months ago
ML-GSAI / SMDM
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆267Updated 7 months ago
HKUNLP / DiffuLLaMA
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆259Updated 2 months ago
igul222 / plaid
☆104Updated 2 years ago
justinlovelace / latent-diffusion-for-language
☆139Updated last year
kuleshov-group / mdlm
[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model
☆466Updated 2 months ago
dllm-reasoning / d1
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆255Updated last month
haonan3 / AnchorContext
AnchorAttention: Improved attention for LLMs long-context training
☆212Updated 6 months ago
zhjgao / difformer
The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)
☆55Updated last year
yihedeng9 / rlhf-summary-notes
A brief and partial summary of RLHF algorithms.
☆131Updated 5 months ago
hanyang1999 / discrete-diffusion-papers
A collection of papers on discrete diffusion models
☆156Updated last month
HKUNLP / diffusion-vs-ar
[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"
☆69Updated 5 months ago
HKUNLP / reparam-discrete-diffusion
Reparameterized Discrete Diffusion Models for Text Generation
☆100Updated 2 years ago
ML-GSAI / RADD
Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…
☆57Updated 2 months ago
xhan77 / ssd-lm
Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
☆74Updated 2 years ago
ericwtodd / function_vectors
Function Vectors in Large Language Models (ICLR 2024)
☆175Updated 3 months ago
Furyton / awesome-language-model-analysis
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…
☆85Updated 8 months ago
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆174Updated last year
yegcjs / DiffusionLLM
Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"
☆83Updated last year
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆107Updated last month
louaaron / Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
☆612Updated last year
TUDB-Labs / MoE-PEFT
An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT
☆108Updated 4 months ago
dtsip / in-context-learning
☆235Updated last year
GFNOrg / gfn-lm-tuning
☆184Updated last year
shawntan / stickbreaking-attention
Stick-breaking attention
☆59Updated last month
kuleshov-group / awesome-discrete-diffusion-models
A curated list for awesome discrete diffusion models resources.
☆416Updated 2 months ago
Lingkai-Kong / RE-Control
Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective
☆32Updated 6 months ago
Vance0124 / Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
☆146Updated 5 months ago
Joshua-Ren / Learning_dynamics_LLM
☆155Updated 2 months ago