HKUNLP / diffusion-vs-ar
[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"
☆47Updated 2 months ago
Alternatives and similar repositories for diffusion-vs-ar:
Users that are interested in diffusion-vs-ar are comparing it to the libraries listed below
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆55Updated 10 months ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆51Updated 5 months ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆77Updated last week
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆76Updated last year
- ☆89Updated 6 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆120Updated 7 months ago
- Stick-breaking attention☆52Updated last month
- ☆83Updated last year
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆151Updated 5 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆47Updated last month
- GenRM-CoT: Data release for verification rationales☆56Updated 6 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆26Updated last year
- ☆31Updated last year
- Extending context length of visual language models☆11Updated 4 months ago
- Language models scale reliably with over-training and on downstream tasks☆96Updated last year
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆28Updated last year
- ☆95Updated last year
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆41Updated 3 months ago
- ☆45Updated last year
- Directional Preference Alignment☆57Updated 7 months ago
- ☆18Updated 11 months ago
- Self-Supervised Alignment with Mutual Information☆17Updated 11 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆132Updated 7 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆154Updated last month
- ☆90Updated 9 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆91Updated 3 weeks ago
- LL3M: Large Language and Multi-Modal Model in Jax☆72Updated last year
- Sparse Backpropagation for Mixture-of-Expert Training☆29Updated 9 months ago
- ☆51Updated 11 months ago
- A repository for research on medium sized language models.☆76Updated 11 months ago