The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models scaling law..
☆46Nov 6, 2025Updated 6 months ago
Alternatives and similar repositories for Quokka
Users that are interested in Quokka are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated last year
- ☆21Apr 16, 2025Updated last year
- ☆100Nov 17, 2025Updated 6 months ago
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆32Jul 6, 2025Updated 10 months ago
- [ICML2025] Official code for "Reinforced Lifelong Editing for Language Models"☆22Feb 23, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆131May 22, 2025Updated last year
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 5 months ago
- ☆15Mar 12, 2024Updated 2 years ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆55Jul 15, 2025Updated 10 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆64Jan 5, 2026Updated 4 months ago
- A Self-Consistent Robust Error (ICML 2022)☆68Jun 25, 2023Updated 2 years ago
- ☆160Mar 30, 2026Updated 2 months ago
- Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆621May 11, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The official github repo for "Diffusion Language Models are Super Data Learners".☆228Nov 6, 2025Updated 6 months ago
- [ASE2024] Mutual Learning-Based Framework for Enhancing Robustness of Code Models via Adversarial Training☆11Sep 13, 2024Updated last year
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆110Sep 18, 2025Updated 8 months ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆64Mar 5, 2026Updated 2 months ago
- Flexible and Pluggable Serving Engine for Diffusion LLMs☆69May 2, 2026Updated 3 weeks ago
- ☆20Mar 14, 2022Updated 4 years ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Apr 15, 2025Updated last year
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆17Feb 15, 2025Updated last year
- Implementation of <Symbolic Graphics Programming with Large Language Models>☆38Sep 14, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Dec 9, 2021Updated 4 years ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆56Apr 28, 2026Updated last month
- Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily☆10May 18, 2026Updated last week
- SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis☆69Jul 24, 2025Updated 10 months ago
- ☆18Oct 17, 2024Updated last year
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆23Mar 2, 2025Updated last year
- 🧮 Algebraic Positional Encodings.☆20Aug 20, 2025Updated 9 months ago
- This repository includes the data and scripts utilized in the study titled "Improving LLM-based Verilog Code Generation with Data Augment…☆14Mar 24, 2025Updated last year
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆116Sep 26, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Internal utility libraries for Pkl☆16May 14, 2026Updated 2 weeks ago
- [ICLR 2024] Towards Robust Multi-Modal Reasoning via Model Selection☆14Mar 7, 2024Updated 2 years ago
- ☆68Feb 4, 2026Updated 3 months ago
- GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…☆334Nov 11, 2025Updated 6 months ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆26Apr 4, 2026Updated last month
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆17Jan 12, 2026Updated 4 months ago
- [CVPR'26] VisPlay: Self-Evolving Vision-Language Models☆57Feb 25, 2026Updated 3 months ago