UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)
☆1,414Jun 12, 2026Updated this week
Alternatives and similar repositories for uccl
Users that are interested in uccl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NVIDIA Inference Xfer Library (NIXL)☆1,079Updated this week
- ☆112Oct 16, 2025Updated 8 months ago
- 📚 TG-EDU综合教育平台 | 支持作业提交📝、批量评分✅、补交申请🔄、团队协作👥、成绩统计📊☆112Mar 24, 2026Updated 2 months ago
- Distributed Compiler based on Triton for Parallel Systems☆1,459Apr 22, 2026Updated last month
- ☆367Jan 28, 2026Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆72Oct 18, 2025Updated 7 months ago
- Attention-based Deep Reinforcement Learning framework for portfolio allocation on S&P 500 equities. Includes custom environment, policy a…☆164Oct 16, 2025Updated 7 months ago
- To help everyone to build their blog to learn☆48Nov 5, 2025Updated 7 months ago
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆532Updated this week
- A fast communication-overlapping library for tensor/expert parallelism on GPUs.☆1,323Aug 28, 2025Updated 9 months ago
- Perplexity GPU Kernels☆586Nov 7, 2025Updated 7 months ago
- The config panel for ai sdk.☆97Nov 2, 2025Updated 7 months ago
- gauss-awesome-recommender-system-engine☆122Oct 6, 2025Updated 8 months ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆192Feb 11, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Interactively browse multimodal tabular data☆112May 6, 2026Updated last month
- FlashInfer: Kernel Library for LLM Serving☆5,791Updated this week
- ☆45Oct 15, 2025Updated 8 months ago
- This project frames the zoning problem as a mixed-integer linear program (MILP) defined over a spatial grid of planning units.☆81Oct 29, 2025Updated 7 months ago
- For dynamic target tracking in flight videos, applicable to various types of unmanned aerial vehicle systems☆86Dec 4, 2025Updated 6 months ago
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆5,569Updated this week
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated last year
- KV cache store for distributed LLM inference☆421Nov 13, 2025Updated 7 months ago
- ☆261Dec 25, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.☆3,705Updated this week
- TVM Documentation in Chinese Simplified / TVM 中文文档☆3,800May 20, 2026Updated 3 weeks ago
- MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems☆44Oct 17, 2025Updated 7 months ago
- 开源 AI 命令行工具,将多模型 AI 智能体、智能工作流和规格驱动开发带入您的终端。(An open-source AI command-line tool that brings multi-model AI agents, intelligent workflows,…☆125Nov 23, 2025Updated 6 months ago
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,632May 22, 2026Updated 3 weeks ago
- Mirage Persistent Kernel: Compiling LLMs into a MegaKernel☆2,305Updated this week
- ☆140Nov 19, 2025Updated 6 months ago
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆6,489Updated this week
- Fulling is an AI-powered Full-stack Engineer Agent. Built with Next.js, Claude, shadcn/ui, and PostgreSQL. Use kubernetes as infra.☆2,422May 22, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.☆9,401Dec 4, 2025Updated 6 months ago
- The next generation deep reinforcement learning tookit☆3,463Jun 16, 2023Updated 2 years ago
- A collection of paper and code for chain of thought finetuning (CoT-Finetuning)☆122Dec 14, 2025Updated 6 months ago
- A lightweight design for computation-communication overlap.☆237Jan 20, 2026Updated 4 months ago
- Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs☆1,024Mar 3, 2026Updated 3 months ago
- A Versatile Point Cloud Processing Framework☆172Sep 30, 2025Updated 8 months ago
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,872Updated this week