microsoft / tokenweaveLinks
Accepted to MLSys 2026
☆70Updated last week
Alternatives and similar repositories for tokenweave
Users that are interested in tokenweave are comparing it to the libraries listed below
Sorting:
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆161Updated 4 months ago
- A lightweight design for computation-communication overlap.☆219Updated 2 weeks ago
- DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit☆92Updated last week
- DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.☆92Updated 3 weeks ago
- DeeperGEMM: crazy optimized version☆73Updated 9 months ago
- Stateful LLM Serving☆95Updated 10 months ago
- ☆47Updated last year
- ☆51Updated 9 months ago
- Aims to implement dual-port and multi-qp solutions in deepEP ibrc transport☆73Updated 8 months ago
- ☆65Updated 9 months ago
- Tile-based language built for AI computation across all scales☆119Updated last week
- ☆84Updated 3 months ago
- ☆81Updated last week
- Nex Venus Communication Library☆72Updated 2 months ago
- gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling☆53Updated 3 weeks ago
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆123Updated last month
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs