shkim0116 / KLASSLinks
[NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"
☆22Updated 2 weeks ago
Alternatives and similar repositories for KLASS
Users that are interested in KLASS are comparing it to the libraries listed below
Sorting:
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasks…☆42Updated last year
- dParallel: Learnable Parallel Decoding for dLLMs☆53Updated 3 months ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆74Updated 6 months ago
- ☆21Updated 6 months ago
- Model Stock: All we need is just a few fine-tuned models☆128Updated 5 months ago
- Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)☆142Updated last year
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆46Updated last year
- Reproduction of DeepSeek-R1☆241Updated 9 months ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆35Updated 4 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆361Updated 7 months ago
- Official implementation of "OffsetBias: Leveraging Debiased Data for Tuning Evaluators"☆25Updated last year
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆152Updated 6 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆193Updated 10 months ago
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆76Updated 9 months ago
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆19Updated last year
- Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆298Updated last month
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".☆252Updated last week
- Ongoing research project for code&math LLMs☆24Updated 6 months ago
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆22Updated 2 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆193Updated last year
- Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]☆89Updated last year
- Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation☆52Updated 5 months ago
- [ICLR 2025] Official PyTorch implementation of "DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation"☆26Updated 6 months ago
- ☆55Updated 7 months ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆79Updated last year
- A Collection of Papers on Diffusion Language Models☆151Updated 4 months ago
- ☆68Updated 10 months ago
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆359Updated last year
- Preference Learning for LLaVA☆58Updated last year
- ☆348Updated 5 months ago