Optimized primitives for collective multi-GPU communication
☆25Apr 17, 2024Updated last year
Alternatives and similar repositories for nccl
Users that are interested in nccl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆154Updated this week
- ☆29Jun 17, 2025Updated 9 months ago
- Fork of Flame repo for training of some new stuff in development☆19Updated this week
- libcaca library to emscripten☆24Mar 10, 2014Updated 12 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Spectre variant 1 exploitation via PRIME+PROBE☆10May 22, 2019Updated 6 years ago
- ☆17Mar 8, 2020Updated 6 years ago
- Unity + TensorRT integration☆15Nov 27, 2018Updated 7 years ago
- A web implementation of cinemagraph☆18Dec 9, 2013Updated 12 years ago
- Helper for handling PySpark DataFrame partition size 📑🎛️☆12Mar 8, 2024Updated 2 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 8 months ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- A Clojure natural language generator built on top of University of Aberdeen’s SimpleNLG library.☆32Aug 24, 2017Updated 8 years ago
- Introduction to MLIR and xDSL training course☆19Oct 2, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Meta project around MLIR☆30Updated this week
- Resources for conference program chairs, especially in systems/PL areas of computer science.☆12May 14, 2023Updated 2 years ago
- Separate from hardware and used to learn some NCCL mechanisms☆25Apr 19, 2024Updated last year
- An experimental library for adding attributes to threads (without rewriting the whole thread interface), for C11 and similar.☆18Nov 17, 2025Updated 4 months ago
- This repository contains the results and code for the MLPerf™ Training v1.1 benchmark.☆23May 18, 2023Updated 2 years ago
- A tool to generate slurm topology configuration from infiniband network discovery.☆23Dec 7, 2016Updated 9 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ¿How to solve logic games using an FPGA? Let's do some experiments!☆11May 30, 2017Updated 8 years ago
- ☆47Dec 13, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Alloy models for automatic synthesis of memory model litmus test suites (from ASPLOS 2017)☆16Jan 26, 2024Updated 2 years ago
- UE4 + Varest + PHP + MySQL实现的一个用户登陆系统☆11Jun 29, 2020Updated 5 years ago
- 基于libjpeg-turbo封装的库,并提供更快速的图片缩放和剪裁功能。☆10Mar 4, 2024Updated 2 years ago
- Generalized Method of Moments estimation☆14Mar 23, 2025Updated last year
- SRv6 IETF 104 Hackathon☆11Dec 8, 2022Updated 3 years ago
- A simple multiple hands tracking implementation based on OpenCV library☆30Apr 1, 2013Updated 13 years ago
- This is a script using webkit to batch download the google satlite maps. As long as providing proper geo information, it can download mul…☆20Oct 20, 2014Updated 11 years ago
- A Python-based desktop client that uses Facebook Messaging as a cloud storage service.☆17Jun 29, 2016Updated 9 years ago
- Implementation of Consistency Models (Song et al 2023) for few-step image generation in Jax.☆19Jun 11, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆22Dec 15, 2023Updated 2 years ago
- A PyTorch native library for training speculative decoding models☆76Updated this week
- Java Virtual Machine in UnrealEngine 4 via JNI☆13Jan 18, 2018Updated 8 years ago
- let a million languages bloom☆22Apr 20, 2025Updated 11 months ago
- Solving Logic Grid Puzzles with Part-of-Speech Tagging and First-Order Logic☆11Dec 18, 2016Updated 9 years ago
- Microsoft Collective Communication Library☆389Sep 20, 2023Updated 2 years ago
- ☆23Aug 21, 2025Updated 7 months ago