ByteDance-Seed / seed-ossLinks
☆852Updated 3 months ago
Alternatives and similar repositories for seed-oss
Users that are interested in seed-oss are comparing it to the libraries listed below
Sorting:
- ☆1,233Updated last month
- A Scientific Multimodal Foundation Model☆620Updated 2 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆710Updated last week
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆716Updated 6 months ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆769Updated 3 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆244Updated last month
- OpenCUA: Open Foundations for Computer-Use Agents☆608Updated last week
- ☆1,383Updated last month
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆514Updated last month
- Scaling RL on advanced reasoning models☆647Updated 2 months ago
- ☆818Updated 6 months ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆539Updated last month
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆553Updated last month
- ☆449Updated this week
- 🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets☆874Updated last month
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆836Updated 4 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆871Updated this week
- Official implementation of "Continuous Autoregressive Language Models"☆677Updated 3 weeks ago
- Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.☆515Updated 3 weeks ago
- ☆424Updated last week
- A construction kit for reinforcement learning environment management.☆252Updated this week
- ☆299Updated 3 months ago
- ☆1,164Updated 2 months ago
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆407Updated last week
- 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.☆476Updated last month
- MiniMax-M2, a model built for Max coding & agentic workflows.☆2,089Updated last month
- MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers☆412Updated 2 months ago
- Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.☆388Updated 2 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆508Updated 3 months ago
- open-source coding LLM for software engineering tasks☆1,073Updated 2 months ago