The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models scaling law..
☆45Nov 6, 2025Updated 4 months ago
Alternatives and similar repositories for Quokka
Users that are interested in Quokka are comparing it to the libraries listed below
Sorting:
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 9 months ago
- ☆20Apr 16, 2025Updated 10 months ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆35Jan 16, 2026Updated last month
- ☆15Mar 12, 2024Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 2 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated 10 months ago
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆30Jul 6, 2025Updated 8 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆52Jul 15, 2025Updated 7 months ago
- ☆47Apr 9, 2025Updated 11 months ago
- ☆42Feb 12, 2026Updated 3 weeks ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Apr 15, 2025Updated 10 months ago
- [ArXiv 2025] Denial-of-Service Poisoning Attacks on Large Language Models☆23Oct 22, 2024Updated last year
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆106Sep 18, 2025Updated 5 months ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆47Sep 8, 2025Updated 6 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆60Jan 5, 2026Updated 2 months ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Feb 9, 2023Updated 3 years ago
- [TMLR 2025] On Memorization in Diffusion Models☆31Oct 5, 2023Updated 2 years ago
- ☆39Aug 28, 2025Updated 6 months ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆62Feb 6, 2026Updated last month
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆66Dec 23, 2025Updated 2 months ago
- [EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia☆138Dec 21, 2024Updated last year
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- Graph Diffusion Policy Optimization☆42Mar 17, 2024Updated last year
- A lightweight script for processing HTML page to markdown format with support for code blocks☆82Apr 14, 2024Updated last year
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆89Sep 26, 2024Updated last year
- ☆50Aug 21, 2025Updated 6 months ago
- Trending projects & awesome papers about data-centric llm studies.☆40May 20, 2025Updated 9 months ago
- A pytorch implementation of FFTNet.☆37Aug 31, 2018Updated 7 years ago
- A Gym for Agentic LLMs☆455Jan 21, 2026Updated last month
- [ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models☆46Dec 4, 2024Updated last year
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- ☆13Updated this week
- The JOS from MIT open course☆11Dec 21, 2011Updated 14 years ago