apple / ml-diffucoderLinks
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
☆776Updated 5 months ago
Alternatives and similar repositories for ml-diffucoder
Users that are interested in ml-diffucoder are comparing it to the libraries listed below
Sorting:
- Dream 7B, a large diffusion language model☆1,099Updated 3 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆353Updated 5 months ago
- ☆1,229Updated 3 weeks ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆756Updated 2 months ago
- dLLM: Simple Diffusion Language Modeling☆1,261Updated last week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆927Updated 6 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆730Updated last week
- Simple & Scalable Pretraining for Neural Architecture Research☆304Updated last week
- [ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆907Updated 5 months ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆532Updated 3 weeks ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆852Updated last month
- ☆342Updated last month
- GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…☆296Updated last month
- Scaling RL on advanced reasoning models☆641Updated last month
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆347Updated 6 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆691Updated last month
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆462Updated 6 months ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆376Updated 5 months ago
- ☆1,339Updated 3 weeks ago
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆346Updated 11 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆359Updated last year
- Large multi-modal models (L3M) pre-training.☆222Updated 2 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆136Updated 4 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆502Updated this week
- ☆564Updated 2 months ago
- ☆712Updated last week
- The official github repo for "Diffusion Language Models are Super Data Learners".☆208Updated last month
- ☆933Updated last month
- Esoteric Language Models☆108Updated 2 weeks ago
- Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research☆456Updated last week