The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models scaling law..
☆45Nov 6, 2025Updated 4 months ago
Alternatives and similar repositories for Quokka
Users that are interested in Quokka are comparing it to the libraries listed below
Sorting:
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 9 months ago
- ☆20Apr 16, 2025Updated 10 months ago
- ☆15Mar 12, 2024Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 2 months ago
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆37Jan 23, 2024Updated 2 years ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated 10 months ago
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆30Jul 6, 2025Updated 8 months ago
- [ICML2025] Official code for "Reinforced Lifelong Editing for Language Models"☆21Feb 23, 2025Updated last year
- A Self-Consistent Robust Error (ICML 2022)☆69Jun 25, 2023Updated 2 years ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆52Jul 15, 2025Updated 7 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆130May 22, 2025Updated 9 months ago
- ☆42Feb 12, 2026Updated 3 weeks ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆106Sep 18, 2025Updated 5 months ago
- ☆20Mar 14, 2022Updated 3 years ago
- [ArXiv 2025] Denial-of-Service Poisoning Attacks on Large Language Models☆23Oct 22, 2024Updated last year
- ☆149Feb 25, 2026Updated last week
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆47Sep 8, 2025Updated 6 months ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Feb 9, 2023Updated 3 years ago
- ☆39Aug 28, 2025Updated 6 months ago
- [TMLR 2025] On Memorization in Diffusion Models☆31Oct 5, 2023Updated 2 years ago
- VisPlay: Self-Evolving Vision-Language Models☆47Feb 25, 2026Updated last week
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆62Feb 6, 2026Updated last month
- [ASE2024] Mutual Learning-Based Framework for Enhancing Robustness of Code Models via Adversarial Training☆11Sep 13, 2024Updated last year
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆223Nov 6, 2025Updated 4 months ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆66Dec 23, 2025Updated 2 months ago
- [EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia☆138Dec 21, 2024Updated last year
- Graph Diffusion Policy Optimization☆42Mar 17, 2024Updated last year
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- Defeating the Training-Inference Mismatch via FP16☆183Nov 14, 2025Updated 3 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆82Apr 14, 2024Updated last year
- SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)☆336Dec 15, 2025Updated 2 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆89Sep 26, 2024Updated last year
- Trending projects & awesome papers about data-centric llm studies.☆40May 20, 2025Updated 9 months ago
- A pytorch implementation of FFTNet.☆37Aug 31, 2018Updated 7 years ago
- A Gym for Agentic LLMs☆455Jan 21, 2026Updated last month
- [ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models☆45Dec 4, 2024Updated last year
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago