The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221
☆31Apr 22, 2025Updated 11 months ago
Alternatives and similar repositories for FlexFusion
Users that are interested in FlexFusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- To pioneer training long-context multi-modal transformer models☆74Aug 8, 2025Updated 8 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆22Mar 25, 2026Updated 2 weeks ago
- ☆34Updated this week
- ☆61Feb 5, 2026Updated 2 months ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆174Feb 11, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.☆95Jan 16, 2026Updated 2 months ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated 2 months ago
- Cute layout visualization☆32Jan 18, 2026Updated 2 months ago
- GenericMusicClient covering QQMusic, Netease, KuGou and etc.一个泛型的,集成了qq音乐,网易云,酷狗等在内的音乐NuGet库☆12Feb 2, 2023Updated 3 years ago
- Docker image with PHP-FPM 8.1 & caddy on Alpine Linux. A fork of TrafeX/docker-php-nginx☆19Jan 26, 2025Updated last year
- WHU.sb Calendar (initiated by @ParaParty)☆12Updated this week
- Arcaea魔改版XD 合成大風暴☆18Feb 1, 2021Updated 5 years ago
- ☆18May 28, 2024Updated last year
- Sample Codes using NVSHMEM on Multi-GPU☆30Jan 22, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- vLLM Daily Summarization of Merged PRs☆49Updated this week
- Proposal for the next generation of course-oriented IR.☆10Dec 24, 2021Updated 4 years ago
- Vortex: A Flexible and Efficient Sparse Attention Framework☆50Updated this week
- ☆13Jan 28, 2026Updated 2 months ago
- patches for huggingface transformers to save memory☆36Jun 2, 2025Updated 10 months ago
- NexRL is an ultra-loosely-coupled LLM post-training framework.☆104Mar 23, 2026Updated 2 weeks ago
- gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling☆56Updated this week
- Pipeline Parallelism Emulation and Visualization☆81Jan 8, 2026Updated 3 months ago
- ☆26Feb 17, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Asynchronous pipeline parallel optimization☆20Feb 2, 2026Updated 2 months ago
- ☆52Apr 30, 2025Updated 11 months ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Feb 9, 2024Updated 2 years ago
- GPU-accelerated LLM Training Simulator☆51Jun 26, 2025Updated 9 months ago
- Benchmark workloads of Boki☆11Sep 8, 2021Updated 4 years ago
- Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation☆124Feb 27, 2026Updated last month
- ☆29Feb 3, 2026Updated 2 months ago
- This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.☆14May 2, 2024Updated last year
- Multi-task end-to-end predict-then-optimize☆13Apr 28, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- DeepSeek-V3/R1 inference performance simulator☆193Mar 27, 2025Updated last year
- Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.☆471Updated this week
- DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit☆95Updated this week
- a simple API to use CUPTI☆10Aug 19, 2025Updated 7 months ago
- Triton-based Symmetric Memory operators and examples☆94Mar 28, 2026Updated 2 weeks ago
- Github mirror of trition-lang/triton repo.☆154Updated this week
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago