UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)
☆1,338Apr 30, 2026Updated this week
Alternatives and similar repositories for uccl
Users that are interested in uccl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NVIDIA Inference Xfer Library (NIXL)☆1,011Updated this week
- ☆112Oct 16, 2025Updated 6 months ago
- 📚 TG-EDU综合教育平台 | 支持作业提交📝、批量评分✅、补交申请🔄、团队协作👥、成绩统计📊☆111Mar 24, 2026Updated last month
- Distributed Compiler based on Triton for Parallel Systems☆1,420Apr 22, 2026Updated 2 weeks ago
- ☆362Jan 28, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆72Oct 18, 2025Updated 6 months ago
- Attention-based Deep Reinforcement Learning framework for portfolio allocation on S&P 500 equities. Includes custom environment, policy a…☆164Oct 16, 2025Updated 6 months ago
- To help everyone to build their blog to learn☆49Nov 5, 2025Updated 6 months ago
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆507Updated this week
- A fast communication-overlapping library for tensor/expert parallelism on GPUs.☆1,297Aug 28, 2025Updated 8 months ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆181Feb 11, 2026Updated 2 months ago
- Perplexity GPU Kernels☆570Nov 7, 2025Updated 5 months ago
- The config panel for ai sdk.☆97Nov 2, 2025Updated 6 months ago
- gauss-awesome-recommender-system-engine☆122Oct 6, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Interactively browse multimodal tabular data☆109Updated this week
- FlashInfer: Kernel Library for LLM Serving☆5,544Updated this week
- ☆44Oct 15, 2025Updated 6 months ago
- This project frames the zoning problem as a mixed-integer linear program (MILP) defined over a spatial grid of planning units.☆81Oct 29, 2025Updated 6 months ago
- For dynamic target tracking in flight videos, applicable to various types of unmanned aerial vehicle systems☆86Dec 4, 2025Updated 5 months ago
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆5,242Updated this week
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated last year
- KV cache store for distributed LLM inference☆416Nov 13, 2025Updated 5 months ago
- ☆251Dec 25, 2025Updated 4 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.☆3,671Jan 26, 2026Updated 3 months ago
- 开源 AI 命令行工具,将多模型 AI 智能体、智能工作流和规格驱动开发带入您的终端。(An open-source AI command-line tool that brings multi-model AI agents, intelligent workflows,…☆121Nov 23, 2025Updated 5 months ago
- TVM Documentation in Chinese Simplified / TVM 中文文档☆3,723Mar 12, 2026Updated last month
- MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems☆43Oct 17, 2025Updated 6 months ago
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,463Apr 9, 2026Updated 3 weeks ago
- Mirage Persistent Kernel: Compiling LLMs into a MegaKernel☆2,234Updated this week
- ☆138Nov 19, 2025Updated 5 months ago
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆5,928Updated this week
- Fulling is an AI-powered Full-stack Engineer Agent. Built with Next.js, Claude, shadcn/ui, and PostgreSQL. Use kubernetes as infra.☆2,416Apr 2, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.☆9,391Dec 4, 2025Updated 5 months ago
- The next generation deep reinforcement learning tookit☆3,464Jun 16, 2023Updated 2 years ago
- A collection of paper and code for chain of thought finetuning (CoT-Finetuning)☆121Dec 14, 2025Updated 4 months ago
- A lightweight design for computation-communication overlap.☆229Jan 20, 2026Updated 3 months ago
- Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs☆1,009Mar 3, 2026Updated 2 months ago
- A Versatile Point Cloud Processing Framework☆172Sep 30, 2025Updated 7 months ago
- Astron-xmod-shim — Lightweight, declarative middleware for reliably converging AI service workloads.☆101Nov 3, 2025Updated 6 months ago