ByteDance-Seed / Stable-DiffCoderLinks
Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, developed by ByteDance Seed.
☆50Updated last week
Alternatives and similar repositories for Stable-DiffCoder
Users that are interested in Stable-DiffCoder are comparing it to the libraries listed below
Sorting:
- LIMI: Less is More for Agency☆159Updated 3 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆33Updated 4 months ago
- ☆19Updated 10 months ago
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆89Updated 3 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆36Updated 3 months ago
- Official Repository of Native Parallel Reasoner☆100Updated last week
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆49Updated this week
- ☆29Updated 2 months ago
- Official Project Page for Web World Models (https://arxiv.org/abs/2512.23676)☆80Updated 3 weeks ago
- XmodelLM☆38Updated last year
- The open-source code of MetaStone-S1.☆105Updated 5 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆29Updated last year
- ☆29Updated last month
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 4 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆60Updated last year
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆64Updated this week
- SSRL: Self-Search Reinforcement Learning☆205Updated 5 months ago
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"☆135Updated 4 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆88Updated 10 months ago
- ☆29Updated 2 months ago
- This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…☆50Updated 2 weeks ago
- ☆95Updated last week
- Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"☆103Updated last week
- Code for Bolmo: Byteifying the Next Generation of Language Models☆115Updated last month
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆45Updated 6 months ago
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆94Updated last week
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆36Updated 2 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆251Updated 2 months ago
- ☆66Updated this week