ByteDance-Seed / VeOmniLinks

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

☆1,374

Alternatives and similar repositories for VeOmni

Users that are interested in VeOmni are comparing it to the libraries listed below

Sorting:

feifeibear / long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
☆605Updated last month
zhuzilin / ring-flash-attention
Ring attention implementation with flash attention
☆923Updated 2 months ago
stepfun-ai / Step3
☆439Updated 3 months ago
NVlabs / Fast-dLLM
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆713Updated last week
fla-org / native-sparse-attention
🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"
☆928Updated 8 months ago
ByteDance-Seed / Seed-Thinking-v1.5
☆819Updated 5 months ago
SandAI-org / MagiAttention
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
☆570Updated this week
alibaba / ROLL
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
☆2,414Updated this week
alibaba / Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
☆1,457Updated 3 weeks ago
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,372Updated 4 months ago
openpsi-project / ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
☆326Updated 7 months ago
THUDM / slime
slime is an LLM post-training framework for RL Scaling.
☆2,612Updated last week
flagos-ai / FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
☆416Updated last week
step-law / steplaw
☆207Updated last month
yongliang-wu / DFT
[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.
☆503Updated last month
Open-Reasoner-Zero / Open-Reasoner-Zero
Official Repo for Open-Reasoner-Zero
☆2,069Updated 6 months ago
InternLM / InternEvo
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…
☆414Updated 3 months ago
sgl-project / SpecForge
Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
☆523Updated this week
feifeibear / LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
☆859Updated last year
MoonshotAI / Kimi-VL
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
☆1,119Updated 4 months ago
ByteDance-Seed / ByteCheckpoint
ByteCheckpoint: An Unified Checkpointing Library for LFMs
☆254Updated this week
thu-ml / SpargeAttn
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
☆798Updated 2 weeks ago
BytedTsinghua-SIA / DAPO
An Open-source RL System from ByteDance Seed and Tsinghua AIR
☆1,653Updated 6 months ago
NVlabs / Long-RL
Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
☆669Updated 2 months ago
mit-han-lab / duo-attention
[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
☆507Updated 9 months ago
xlite-dev / Awesome-DiT-Inference
📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉
☆454Updated last week
volcengine / veScale
Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs
☆893Updated last week
bytedance / flux
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
☆1,182Updated 3 months ago
vllm-project / vllm-omni
A framework for efficient model inference with omni-modality models
☆466Updated this week
EvolvingLMMs-Lab / open-r1-multimodal
A fork to add multimodal model training to open-r1
☆1,423Updated 9 months ago