Optimized primitives for collective multi-GPU communication
☆25Apr 17, 2024Updated 2 years ago
Alternatives and similar repositories for nccl
Users that are interested in nccl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Jan 13, 2026Updated 4 months ago
- 华为集合通信性能测试☆16May 27, 2024Updated 2 years ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆155May 6, 2026Updated 3 weeks ago
- Parallel Computing -- Validation Suite: Validation engine for Exascale project benchmarks☆16Mar 26, 2026Updated 2 months ago
- A PyTorch native library for training speculative decoding models☆116Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆44Dec 31, 2021Updated 4 years ago
- Spectre variant 1 exploitation via PRIME+PROBE☆10May 22, 2019Updated 7 years ago
- ☆17Mar 8, 2020Updated 6 years ago
- A flexible and high-performance training framework designed for large-scale foundation model training on AMD GPUs☆98Updated this week
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 10 months ago
- ☆36Apr 1, 2026Updated last month
- go based implementation of BGP's BMP protocol☆123May 19, 2026Updated last week
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆17Oct 11, 2021Updated 4 years ago
- Resources for conference program chairs, especially in systems/PL areas of computer science.☆12May 14, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Separate from hardware and used to learn some NCCL mechanisms☆27Apr 19, 2024Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Training v1.1 benchmark.☆23May 18, 2023Updated 3 years ago
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- A tool to generate slurm topology configuration from infiniband network discovery.☆23Dec 7, 2016Updated 9 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ¿How to solve logic games using an FPGA? Let's do some experiments!☆11May 30, 2017Updated 8 years ago
- ☆47Dec 13, 2024Updated last year
- Alloy models for automatic synthesis of memory model litmus test suites (from ASPLOS 2017)☆16Jan 26, 2024Updated 2 years ago
- UE4 + Varest + PHP + MySQL实现的一个用户登陆系统☆11Jun 29, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Fast and easy distributed model training examples.☆12Nov 26, 2024Updated last year
- ☆11Dec 20, 2022Updated 3 years ago
- SRv6 IETF 104 Hackathon☆11Dec 8, 2022Updated 3 years ago
- MLIR+EqSat☆26Jan 10, 2026Updated 4 months ago
- Effective Attention Sheds Light On Interpretability - Findings of ACL2021☆11May 16, 2021Updated 5 years ago
- ☆39Feb 28, 2019Updated 7 years ago
- ☆22Dec 15, 2023Updated 2 years ago
- Solving Logic Grid Puzzles with Part-of-Speech Tagging and First-Order Logic☆11Dec 18, 2016Updated 9 years ago
- Microsoft Collective Communication Library☆390Sep 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Implementation of Knowledge Flow Prompting☆35Oct 20, 2025Updated 7 months ago
- ☆17Nov 1, 2023Updated 2 years ago
- a rust port of the lox interpreter described in Crafting Interpreters by Robert Nystrom☆14Jan 10, 2025Updated last year
- K Junior is an MIT licensed open source array language written by Arthur Whitney.☆23Jun 1, 2024Updated last year
- Developing a legal research tool leveraging ChatGPT / GPT-4☆14Mar 10, 2024Updated 2 years ago
- PyTorch distributed training acceleration framework☆55Aug 13, 2025Updated 9 months ago
- MLIR grammar for tree-sitter☆19May 8, 2026Updated 2 weeks ago