FlexFusion/FlexFusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FlexFusion/FlexFusion)

FlexFusion / FlexFusion

The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221

☆31

Alternatives and similar repositories for FlexFusion

Users that are interested in FlexFusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tele-AI / TeleTron
View on GitHub
To pioneer training long-context multi-modal transformer models
☆75Aug 8, 2025Updated 11 months ago
ChandlerGuan / mercury_artifact
View on GitHub
☆27Oct 1, 2025Updated 9 months ago
muriloboratto / NVSHEMEM
View on GitHub
Sample Codes using NVSHMEM on Multi-GPU
☆30Jan 22, 2023Updated 3 years ago
infinigence / HamiltonAttention
View on GitHub
☆45Oct 15, 2025Updated 9 months ago
leepoly / sm-profiler
View on GitHub
☆83Feb 5, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
TransferQueue / TransferQueue
View on GitHub
[Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…
☆16Jan 16, 2026Updated 6 months ago
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated 11 months ago
TheSnowfield / MergeTempestissimo
View on GitHub
Arcaea魔改版XD 合成大風暴
☆18Feb 1, 2021Updated 5 years ago
inclusionAI / asystem-amem
View on GitHub
A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.
☆110Dec 17, 2025Updated 7 months ago
oliverYoung2001 / UltraAttn
View on GitHub
SC'25 UltraAttn: Efficiently Parallelizing Attention through Hierarchical Context-Tiling
☆16Aug 14, 2025Updated 11 months ago
CGCL-codes / streambox
View on GitHub
☆18May 28, 2024Updated 2 years ago
Junchao-cs / LIVE
View on GitHub
[ICML 2026] "LIVE: Long-horizon Interactive Video World ModEling"
☆35Jul 15, 2026Updated last week
vipulSharma18 / NCCL-From-First-Principles
View on GitHub
NCCL communication API layer, and transport layer created from first principles.
☆16Aug 20, 2025Updated 11 months ago
ByteDance-Seed / StragglerAnalysis
View on GitHub
☆56Apr 30, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ChandlerGuan / kperfir_artifact
View on GitHub
☆19May 9, 2025Updated last year
pku-minic / next-gen-ir-proposal
View on GitHub
Proposal for the next generation of course-oriented IR.
☆10Dec 24, 2021Updated 4 years ago
chenyu-jiang / dcp
View on GitHub
Code repository for the SOSP'25 paper DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context Parallelism.
☆21Nov 28, 2025Updated 7 months ago
NVIDIA / nccl-extensions
View on GitHub
Communication patterns for AI, built on top of NCCL device and host APIs
☆18Updated this week
TJU-NSL / awesome-papers
View on GitHub
☆37Updated this week
fzyzcjy / torch_memory_saver
View on GitHub
Allow torch tensor memory to be released and resumed later
☆260Updated this week
radixark / miles_diffusion
View on GitHub
[Experimental] Miles-diffusion is an post-training framework for large-scale diffusion model training and production workloads, forked fr…
☆22Updated this week
foundation-model-stack / vllm-triton-backend
View on GitHub
A Triton-only attention backend for vLLM
☆27Jul 14, 2026Updated last week
nex-agi / NexVenusCL
View on GitHub
Nex Venus Communication Library
☆75Nov 17, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tilde-research / comp-muon-release
View on GitHub
Compositional Muon release
☆23Jun 5, 2026Updated last month
AI-Infra-Team / awesome-papers
View on GitHub
Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.
☆69Mar 4, 2026Updated 4 months ago
eesast / web
View on GitHub
Web front for EESAST
☆11Updated this week
wassemgtk / MegaScale-Infer-Prototyp
View on GitHub
Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism
☆31Apr 4, 2025Updated last year
Xilinx / libdfx
View on GitHub
☆13Jun 14, 2026Updated last month
GeeeekExplorer / transformers-patch
View on GitHub
patches for huggingface transformers to save memory
☆36May 9, 2026Updated 2 months ago
Oneflow-Inc / dfccl
View on GitHub
☆26Feb 17, 2025Updated last year
antgroup / DeepXTrace
View on GitHub
DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.
☆101Jan 16, 2026Updated 6 months ago
nex-agi / NexRL
View on GitHub
NexRL is an ultra-loosely-coupled LLM post-training framework.
☆114May 13, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kungfu-team / tenplex
View on GitHub
Dynamic resources changes for multi-dimensional parallelism training
☆31Aug 22, 2025Updated 11 months ago
sgl-project / DeepGEMM
View on GitHub
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
☆32Updated this week
lsds / Tempo
View on GitHub
Tempo is a system for declarative, efficient, end-to-end compiled dynamic deep learning
☆30Oct 21, 2025Updated 9 months ago
ISEEKYAN / mbridge
View on GitHub
Bridge Megatron-Core to Hugging Face/Reinforcement Learning
☆226Jun 15, 2026Updated last month
ut-osa / boki-benchmarks
View on GitHub
Benchmark workloads of Boki
☆11Sep 8, 2021Updated 4 years ago
TrustAIRLab / HarmfulSkillBench
View on GitHub
The Official Repository for Paper "HarmfulSkillBench: How Do Harmful Skills Weaponize Your Agents?"
☆15May 2, 2026Updated 2 months ago
PKUFlyingPig / Hadoop_vs_Spark
View on GitHub
研究生课《网络大数据管理理论和应用》大作业项目代码
☆13Dec 31, 2022Updated 3 years ago