apple / ml-diffucoderLinks
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
☆735Updated 2 months ago
Alternatives and similar repositories for ml-diffucoder
Users that are interested in ml-diffucoder are comparing it to the libraries listed below
Sorting:
- Dream 7B, a large diffusion language model☆984Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆343Updated 3 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆296Updated last month
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆609Updated last week
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆635Updated 3 weeks ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆873Updated 3 months ago
- Scaling RL on advanced reasoning models☆591Updated last month
- Post-training with Tinker☆550Updated this week
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆261Updated last week
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆443Updated 4 months ago
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆832Updated 2 months ago
- ☆816Updated 2 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆343Updated 9 months ago
- ☆773Updated 3 weeks ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆751Updated this week
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆309Updated 4 months ago
- Large multi-modal models (L3M) pre-training.☆170Updated last week
- Pretraining and inference code for a large-scale depth-recurrent language model☆829Updated 3 weeks ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆125Updated last month
- Self-Adapting Language Models☆800Updated 2 months ago
- Esoteric Language Models☆99Updated 2 months ago
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆461Updated last week
- Code for the paper: "Learning to Reason without External Rewards"☆357Updated 2 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆472Updated 2 months ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆265Updated last month
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆602Updated 6 months ago
- Tina: Tiny Reasoning Models via LoRA☆284Updated last week
- Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆220Updated 2 weeks ago
- Benchmark environment for evaluating vision-language models (VLMs) on popular video games!☆305Updated 4 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆552Updated 3 months ago