DeepLink-org/DLSlime

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DeepLink-org/DLSlime)

DeepLink-org / DLSlime

Composable and Embeddable Communication Runtime for Distributed AI Services

☆102

Alternatives and similar repositories for DLSlime

Users that are interested in DLSlime are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JimyMa / FuncTs
View on GitHub
[DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning
☆15Jan 13, 2024Updated 2 years ago
DeepLink-org / DLCompiler
View on GitHub
triton for dsa
☆68Jul 10, 2026Updated last week
DeepLink-org / DIOPI
View on GitHub
☆76Nov 22, 2024Updated last year
KuangjuX / NVSHMEM-Tutorial
View on GitHub
NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
☆195Feb 11, 2026Updated 5 months ago
DeepLink-org / deeplink.framework
View on GitHub
☆76Oct 31, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
stepfun-ai / StepMesh
View on GitHub
☆377Jan 28, 2026Updated 5 months ago
DeepLink-org / DeepLinkExt
View on GitHub
☆13May 23, 2025Updated last year
TransferQueue / TransferQueue
View on GitHub
[Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…
☆16Jan 16, 2026Updated 6 months ago
sii-research / VCCL
View on GitHub
Venus Collective Communication Library, supported by SII and Infrawaves.
☆151Jun 24, 2026Updated 3 weeks ago
inclusionAI / asystem-amem
View on GitHub
A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.
☆110Dec 17, 2025Updated 7 months ago
infinigence / FlashOverlap
View on GitHub
A lightweight design for computation-communication overlap.
☆242Jan 20, 2026Updated 6 months ago
DeepLink-org / dlinfer
View on GitHub
☆74Updated this week
cat538 / SKVQ
View on GitHub
[COLM 2024] SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models
☆24Oct 5, 2024Updated last year
Oneflow-Inc / dfccl
View on GitHub
☆26Feb 17, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,494Updated this week
kvcache-ai / TrEnv-X
View on GitHub
☆95Sep 15, 2025Updated 10 months ago
DeepLink-org / AIChipBenchmark
View on GitHub
☆35Mar 27, 2026Updated 3 months ago
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
flagos-ai / FlagCX
View on GitHub
FlagCX is a scalable and adaptive cross-chip communication library.
☆218Jul 8, 2026Updated last week
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated 11 months ago
infinigence / FUSCO
View on GitHub
High-performance distributed data shuffling (all-to-all) library for MoE training and inference
☆123Mar 7, 2026Updated 4 months ago
perplexityai / pplx-kernels
View on GitHub
Perplexity GPU Kernels
☆591Nov 7, 2025Updated 8 months ago
DeepLink-org / DLBlas
View on GitHub
DLBlas: clean and efficient kernels
☆43Jul 7, 2026Updated last week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
inclusionAI / AState
View on GitHub
☆41Dec 9, 2025Updated 7 months ago
DeepLink-org / DLOP-Bench
View on GitHub
A benchmark suited especially for deep learning operators
☆42Feb 13, 2023Updated 3 years ago
aibrix / PrisKV
View on GitHub
High Performance KV Cache Store for LLM
☆59May 20, 2026Updated 2 months ago
NVIDIA / nvshmem
View on GitHub
NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process com…
☆560Updated this week
tlc-pack / cutlass_fpA_intB_gemm
View on GitHub
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
☆96Jun 21, 2026Updated 3 weeks ago
DeepLink-org / CVFusion
View on GitHub
CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.
☆33Aug 31, 2022Updated 3 years ago
ai-dynamo / nixl
View on GitHub
NVIDIA Inference Xfer Library (NIXL)
☆1,139Updated this week
aikitoria / nanotrace
View on GitHub
Low overhead tracing library and trace visualizer for pipelined CUDA kernels
☆137Updated this week
meta-pytorch / torchcomms
View on GitHub
torchcomms: a modern PyTorch communications API
☆377Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
uccl-project / uccl
View on GitHub
UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g…
☆1,467Updated this week
nex-agi / NexVenusCL
View on GitHub
Nex Venus Communication Library
☆75Nov 17, 2025Updated 8 months ago
DeepLink-org / DLRouter
View on GitHub
☆20Jun 11, 2026Updated last month
axio-project / FuseLink
View on GitHub
Efficient GPU communication over multiple NICs.
☆29Nov 20, 2025Updated 8 months ago
foundry-org / foundry
View on GitHub
Foundry materializes CUDA graphs along with its execution context to disk to support fast cold start of serving engines.
☆45Jul 8, 2026Updated last week
foundation-model-stack / vllm-triton-backend
View on GitHub
A Triton-only attention backend for vLLM
☆27Updated this week
jiazhihao / attention_superoptimizer
View on GitHub
An Attention Superoptimizer
☆22Jan 20, 2025Updated last year