JinjieNi / QuokkaLinks

The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models scaling law..

☆42

Alternatives and similar repositories for Quokka

Users that are interested in Quokka are comparing it to the libraries listed below

Sorting:

bigai-nlco / LatentSeek
Official Repository of LatentSeek
☆69Updated 6 months ago
yczhou001 / Awesome-Diffusion-LLM
paper list, tutorial, and nano code snippet for Diffusion Large Language Models.
☆136Updated 5 months ago
RUCBM / DeepCritic
Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"
☆41Updated 5 months ago
sail-sg / AnytimeReasoner
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆48Updated 4 months ago
DreamLM / Dream-Coder
☆73Updated 2 weeks ago
OpenSparseLLMs / MoM
☆110Updated 2 months ago
DreamLM / DreamOn
Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas
☆90Updated 2 months ago
ML-GSAI / Diffusion-LLM-Papers
A Collection of Papers on Diffusion Language Models
☆147Updated 2 months ago
horseee / dKV-Cache
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆121Updated 6 months ago
ruixin31 / Spurious_Rewards
☆344Updated 4 months ago
haonan3 / V1
V1: Toward Multimodal Reasoning by Designing Auxiliary Task
☆36Updated 7 months ago
UCSB-NLP-Chang / ThinkPrune
☆45Updated 2 months ago
hkust-nlp / mstar
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆69Updated 4 months ago
hkust-nlp / Laser
Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
☆60Updated 6 months ago
ShadeCloak / ADORA
☆46Updated 7 months ago
xuyige / SoftCoT
ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…
☆66Updated 6 months ago
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆168Updated 6 months ago
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆142Updated 4 months ago
multimodal-art-projection / LatentCoT-Horizon
📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.
☆290Updated last month
LeapLabTHU / limit-of-RLVR
repo for paper https://arxiv.org/abs/2504.13837
☆271Updated 5 months ago
PKU-ML / LongPPL
Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"
☆105Updated last month
ryoungj / BoLT
Code for "Reasoning to Learn from Latent Thoughts"
☆122Updated 8 months ago
xiaomi-research / colar
[NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
☆62Updated 4 months ago
ML-GSAI / LLaDA-1.5
☆54Updated 6 months ago
Gen-Verse / dLLM-RL
TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
☆339Updated 2 weeks ago
HKUNLP / DiffuLLaMA
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆341Updated 6 months ago
maple-research-lab / LLaDOU
Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models"
☆68Updated 4 months ago
yunfeixie233 / ViGaL
☆62Updated last month
ThinkMorph / ThinkMorph
The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
☆116Updated last week
pixeli99 / Prophet
Official implementation of "Diffusion Language Models Know the Answer Before Decoding"
☆39Updated 2 months ago