HKUNLP / DiffuLLaMA
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆89Updated 2 months ago
Alternatives and similar repositories for DiffuLLaMA:
Users that are interested in DiffuLLaMA are comparing it to the libraries listed below
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆66Updated last month
- ☆86Updated last year
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆109Updated 11 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆65Updated last year
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆87Updated this week
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆45Updated 3 months ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data"☆25Updated 5 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆48Updated 2 months ago
- Simplified Masked Diffusion Language Model☆273Updated 2 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆77Updated 4 months ago
- Reparameterized Discrete Diffusion Models for Text Generation☆94Updated 2 years ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆35Updated 4 months ago
- ☆80Updated 11 months ago
- ☆44Updated 6 months ago
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆32Updated last week
- The official implementation of Self-Exploring Language Models (SELM)☆61Updated 8 months ago
- ☆82Updated 4 months ago
- Stick-breaking attention☆43Updated last month
- Directional Preference Alignment☆56Updated 4 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆42Updated 6 months ago
- ☆71Updated 4 months ago
- ☆51Updated 4 months ago
- ☆51Updated 8 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆29Updated last month
- ☆71Updated 6 months ago
- This repository is maintained to release dataset and models for multimodal puzzle reasoning.☆63Updated 2 weeks ago
- A brief and partial summary of RLHF algorithms.☆93Updated 2 months ago
- ☆28Updated 3 months ago
- Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…☆96Updated 5 months ago
- Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆104Updated 3 weeks ago