BaguaSys/bagua-net

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BaguaSys/bagua-net)

BaguaSys / bagua-net

High performance NCCL plugin for Bagua.

☆15

Alternatives and similar repositories for bagua-net

Users that are interested in bagua-net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BaguaSys / bagua-core
View on GitHub
Core communication lib for Bagua.
☆48Sep 15, 2021Updated 4 years ago
BaguaSys / tutorials
View on GitHub
Bagua tutorials.
☆13Sep 4, 2022Updated 3 years ago
google / nccl-fastsocket
View on GitHub
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
☆125Nov 15, 2023Updated 2 years ago
llnl / Aluminum
View on GitHub
High-performance, GPU-aware communication library
☆90Dec 16, 2025Updated 7 months ago
BaguaSys / operator
View on GitHub
Kubernetes operator for Bagua distributed training job.
☆13Feb 7, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Funatiq / gossip
View on GitHub
gossip: Efficient Communication Primitives for Multi-GPU Systems
☆62Jul 1, 2022Updated 4 years ago
byteps / examples
View on GitHub
BytePS examples (Vision, NLP, GAN, etc)
☆19Nov 24, 2022Updated 3 years ago
magic3007 / MiniJava-Compiler
View on GitHub
🕹 Implementation for the lesson Compiling Engineering(2020 Spring) in Peking University, adjusted from UCLA CS 132 Project.
☆10Jun 21, 2020Updated 6 years ago
exoshuffle / raysort
View on GitHub
☆16Sep 4, 2023Updated 2 years ago
PatrickGuo / Mistify
View on GitHub
☆10May 16, 2021Updated 5 years ago
NVIDIA / atex
View on GitHub
A TensorFlow Extension: GPU performance tools for TensorFlow.
☆26Jul 27, 2023Updated 2 years ago
tgangwani / RegAlloc
View on GitHub
Chaitin-Briggs register-allocation algorithm (LLVM back-end)
☆12Jan 6, 2016Updated 10 years ago
PersiaML / PERSIA
View on GitHub
High performance distributed framework for training deep learning recommendation models based on PyTorch.
☆414Updated this week
egraphs-good / egraphs-good.github.io
View on GitHub
egraphs-good website
☆18Mar 10, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
stepbuystep / LightNAS
View on GitHub
You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms
☆12Apr 17, 2023Updated 3 years ago
ray-project / ray_shuffling_data_loader
View on GitHub
A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…
☆18Jan 5, 2023Updated 3 years ago
kimihe / Falcon
View on GitHub
A computation-parallel deep learning architecture.
☆13Sep 25, 2019Updated 6 years ago
Ther-nullptr / Awesome-Transformer-Accleration
View on GitHub
Paper list for accleration of transformers
☆14Jul 1, 2023Updated 3 years ago
ezyang / SMT-LIB-benchmarks-pytorch-shapes
View on GitHub
SMT-LIB benchmarks for shape computations from deep learning models in PyTorch
☆18Dec 21, 2022Updated 3 years ago
microsoft / SwitchML
View on GitHub
Switch-based Training Acceleration for Machine Learning (SwitchML)
☆16Apr 13, 2021Updated 5 years ago
oscomp / proj6-user-level-interrupt
View on GitHub
基于FPGA实现用户态中断硬件机制与优化操作系统内核
☆10Apr 1, 2025Updated last year
lzhangbv / dear_pytorch
View on GitHub
[ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining
☆12Dec 4, 2023Updated 2 years ago
MashPlant / decaf-rs
View on GitHub
Framework of pa code for THU compiler principle course.
☆13Dec 18, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
markcty / rMiniK8s
View on GitHub
A simple containerized application manage system like Kubernetes, but written in Rust
☆19Jun 25, 2022Updated 4 years ago
HKBU-HPML / OMGS-SGD
View on GitHub
Layer-wise Sparsification of Distributed Deep Learning
☆10Jul 6, 2020Updated 6 years ago
clojurists-together / clojuriststogether.org
View on GitHub
Clojurists Together site
☆13Jul 17, 2026Updated last week
Jason-cs18 / Awesome-AI-Systems
View on GitHub
Resources for recent AI systems (deployment concerns, cost and accessibility). -- closed
☆12May 29, 2021Updated 5 years ago
stanford-futuredata / gavel
View on GitHub
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆139Jul 25, 2024Updated 2 years ago
Youhe-Jiang / IJCAI2023-OptimalShardedDataParallel
View on GitHub
[IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any inte…
☆52May 31, 2023Updated 3 years ago
bytedance / QSync
View on GitHub
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Feb 23, 2024Updated 2 years ago
ROCm / rdc
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆30Updated this week
GMAP / NPB-GPU
View on GitHub
NAS Parallel Benchmarks for evaluating GPU and APIs
☆32Sep 29, 2025Updated 9 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
MengjiaoZhang / DBCL
View on GitHub
Code for Double Blind CollaborativeLearning (DBCL)
☆14May 14, 2021Updated 5 years ago
Crisescode / distributed-training-dl
View on GitHub
各种深度学习（DL）框架分布式训练，包括：Tensorflow、Tensorflow2、Pytorch、Chainer、Caffe、Mxnet ...
☆22Aug 8, 2020Updated 5 years ago
xinjin / course-net-seminar
View on GitHub
Selected Topics in Computer Networks @ Johns Hopkins University
☆19Dec 17, 2020Updated 5 years ago
summertriangle-dev / hatedelay
View on GitHub
Operating tools for texture bank files.
☆11Nov 2, 2016Updated 9 years ago
In-Network-Machine-Learning / QCMP
View on GitHub
☆19Feb 8, 2024Updated 2 years ago
S-Lab-System-Group / Primo
View on GitHub
Primo: Practical Learning-Augmented Systems with Interpretable Models
☆19Dec 26, 2023Updated 2 years ago
robclewley / ieee754_simulation
View on GitHub
Complete simulation of IEEE 754 fixed and floating point specification to any precision
☆13Aug 26, 2020Updated 5 years ago