apple / ml-diffucoderLinks
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
☆544Updated this week
Alternatives and similar repositories for ml-diffucoder
Users that are interested in ml-diffucoder are comparing it to the libraries listed below
Sorting:
- Dream 7B, a large diffusion language model☆816Updated 3 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆295Updated 3 weeks ago
- Scaling RL on advanced reasoning models☆392Updated this week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆798Updated last month
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆314Updated 8 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆412Updated last month
- Pretraining code for a large-scale depth-recurrent language model☆793Updated last month
- GRadient-INformed MoE☆263Updated 9 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆341Updated 7 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆112Updated this week
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆726Updated this week
- Releases from OpenAI Preparedness☆792Updated last month
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆566Updated 3 months ago
- Benchmark environment for evaluating vision-language models (VLMs) on popular video games!☆280Updated last month
- Tina: Tiny Reasoning Models via LoRA☆266Updated last month
- TPI-LLM: Serving 70b-scale LLMs Efficiently on Low-resource Edge Devices☆185Updated last month
- ☆485Updated this week
- DFloat11: Lossless LLM Compression for Efficient GPU Inference☆446Updated last month
- Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model☆222Updated last month
- Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆216Updated 2 weeks ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆235Updated last week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆404Updated this week
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models☆1,176Updated last month
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆527Updated last month
- PyTorch implementation of models from the Zamba2 series.☆183Updated 5 months ago
- ☆179Updated 7 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆298Updated this week
- Self-Adapting Language Models☆697Updated 3 weeks ago
- Build your own visual reasoning model☆395Updated this week
- Muon is Scalable for LLM Training☆1,093Updated 3 months ago