UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)
☆1,373May 22, 2026Updated this week
Alternatives and similar repositories for uccl
Users that are interested in uccl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NVIDIA Inference Xfer Library (NIXL)☆1,041Updated this week
- ☆112Oct 16, 2025Updated 7 months ago
- 📚 TG-EDU综合教育平台 | 支持作业提交📝、批量评分✅、补交申请🔄、团队协作👥、成绩统计📊☆112Mar 24, 2026Updated 2 months ago
- Distributed Compiler based on Triton for Parallel Systems☆1,440Apr 22, 2026Updated last month
- ☆365Jan 28, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆72Oct 18, 2025Updated 7 months ago
- Attention-based Deep Reinforcement Learning framework for portfolio allocation on S&P 500 equities. Includes custom environment, policy a…☆164Oct 16, 2025Updated 7 months ago
- To help everyone to build their blog to learn☆48Nov 5, 2025Updated 6 months ago
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆520Updated this week
- A fast communication-overlapping library for tensor/expert parallelism on GPUs.☆1,309Aug 28, 2025Updated 8 months ago
- Perplexity GPU Kernels☆576Nov 7, 2025Updated 6 months ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆190Feb 11, 2026Updated 3 months ago
- The config panel for ai sdk.☆97Nov 2, 2025Updated 6 months ago
- gauss-awesome-recommender-system-engine☆122Oct 6, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Interactively browse multimodal tabular data☆110May 6, 2026Updated 2 weeks ago
- FlashInfer: Kernel Library for LLM Serving☆5,666Updated this week
- ☆45Oct 15, 2025Updated 7 months ago
- This project frames the zoning problem as a mixed-integer linear program (MILP) defined over a spatial grid of planning units.☆81Oct 29, 2025Updated 6 months ago
- For dynamic target tracking in flight videos, applicable to various types of unmanned aerial vehicle systems☆86Dec 4, 2025Updated 5 months ago
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆5,401Updated this week
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated last year
- KV cache store for distributed LLM inference☆419Nov 13, 2025Updated 6 months ago
- ☆257Dec 25, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.☆3,686May 18, 2026Updated last week
- 开源 AI 命令行工具,将多模型 AI 智能体、智能工作流和规格驱动开发带入您的终端。(An open-source AI command-line tool that brings multi-model AI agents, intelligent workflows,…☆124Nov 23, 2025Updated 6 months ago
- TVM Documentation in Chinese Simplified / TVM 中文文档☆3,761Updated this week
- MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems☆44Oct 17, 2025Updated 7 months ago
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,560May 18, 2026Updated last week
- Mirage Persistent Kernel: Compiling LLMs into a MegaKernel☆2,271Updated this week
- ☆140Nov 19, 2025Updated 6 months ago
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆6,278Updated this week
- Fulling is an AI-powered Full-stack Engineer Agent. Built with Next.js, Claude, shadcn/ui, and PostgreSQL. Use kubernetes as infra.☆2,424Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.☆9,395Dec 4, 2025Updated 5 months ago
- The next generation deep reinforcement learning tookit☆3,464Jun 16, 2023Updated 2 years ago
- A collection of paper and code for chain of thought finetuning (CoT-Finetuning)☆121Dec 14, 2025Updated 5 months ago
- A lightweight design for computation-communication overlap.☆234Jan 20, 2026Updated 4 months ago
- Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs☆1,014Mar 3, 2026Updated 2 months ago
- A Versatile Point Cloud Processing Framework☆172Sep 30, 2025Updated 7 months ago
- Astron-xmod-shim — Lightweight, declarative middleware for reliably converging AI service workloads.☆102Nov 3, 2025Updated 6 months ago