ByteDance-Seed / seed-ossLinks
☆816Updated 2 weeks ago
Alternatives and similar repositories for seed-oss
Users that are interested in seed-oss are comparing it to the libraries listed below
Sorting:
- A Scientific Multimodal Foundation Model☆574Updated last month
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆635Updated 3 weeks ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆751Updated this week
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆440Updated last week
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆552Updated 3 months ago
- open-source coding LLM for software engineering tasks☆955Updated 3 months ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆609Updated last week
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆186Updated 2 weeks ago
- OpenCUA: Open Foundations for Computer-Use Agents☆488Updated 2 weeks ago
- ☆683Updated this week
- ☆816Updated 3 months ago
- ☆966Updated last week
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆692Updated 2 months ago
- ☆202Updated 2 weeks ago
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.☆387Updated last week
- All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.☆462Updated this week
- Scaling RL on advanced reasoning models☆591Updated last month
- MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers☆333Updated this week
- 🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.☆442Updated last month
- The official repository of the dots.llm1 base and instruct models proposed by rednote-hilab.☆465Updated last month
- ☆293Updated 4 months ago
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆752Updated 2 months ago
- Speed Always Wins: A Survey on Efficient Architectures for Large Language Models☆337Updated last month
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆443Updated 4 months ago
- Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…☆2,339Updated last week
- MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.☆236Updated last month
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆432Updated 3 weeks ago
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆409Updated 3 weeks ago
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆735Updated 2 months ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆353Updated last month