ML-GSAI / LLaDA
Official PyTorch implementation for "Large Language Diffusion Models"
☆1,458Updated last week
Alternatives and similar repositories for LLaDA:
Users that are interested in LLaDA are comparing it to the libraries listed below
- Dream 7B, a large diffusion language model☆526Updated last week
- Muon is Scalable for LLM Training☆1,020Updated 3 weeks ago
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆519Updated this week
- Pretraining code for a large-scale depth-recurrent language model☆734Updated this week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,051Updated 2 months ago
- MoBA: Mixture of Block Attention for Long-Context LLMs☆1,733Updated 2 weeks ago
- Code for BLT research paper☆1,445Updated last week
- An Open Large Reasoning Model for Real-World Solutions☆1,482Updated last month
- Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper☆588Updated 3 weeks ago
- Next-Token Prediction is All You Need☆2,076Updated last month
- Official Repo for Open-Reasoner-Zero☆1,850Updated last week
- OLMoE: Open Mixture-of-Experts Language Models☆713Updated last month
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,099Updated last week
- Witness the aha moment of VLM with less than $3.☆3,522Updated last month
- Understanding R1-Zero-Like Training: A Critical Perspective☆845Updated this week
- ☆1,351Updated 4 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,037Updated 2 months ago
- Recipes to scale inference-time compute of open models☆1,051Updated last month
- Democratizing Reinforcement Learning for LLMs☆2,976Updated this week
- Large Reasoning Models☆800Updated 4 months ago
- Muon optimizer: +>30% sample efficiency with <3% wallclock overhead☆575Updated 3 weeks ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆1,359Updated last week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆1,847Updated this week
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆1,980Updated 8 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,684Updated 8 months ago
- LIMO: Less is More for Reasoning☆905Updated last week
- ☆662Updated this week
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆860Updated last month
- A fork to add multimodal model training to open-r1☆1,181Updated 2 months ago
- O1 Replication Journey☆1,983Updated 3 months ago