hwang595/PyTorch-parameter-server

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hwang595/PyTorch-parameter-server)

hwang595 / PyTorch-parameter-server

Implementation of Parameter Server using PyTorch communication lib

☆41

Alternatives and similar repositories for PyTorch-parameter-server

Users that are interested in PyTorch-parameter-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hwang595 / ps_pytorch
View on GitHub
implement distributed machine learning with Pytorch + OpenMPI
☆53Mar 22, 2019Updated 7 years ago
srQ-cpc / D-PSGD
View on GitHub
Algorithm: Decentralized Parallel Stochastic Gradient Descent
☆48Sep 2, 2018Updated 7 years ago
hwang595 / Draco
View on GitHub
DRACO: Byzantine-resilient Distributed Training via Redundant Gradients
☆23Dec 9, 2018Updated 7 years ago
jiazhihao / sosp19ae
View on GitHub
Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions
☆21Apr 15, 2022Updated 4 years ago
abhijangda / nextdoor-experiments
View on GitHub
☆12Dec 8, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / stochastic_gradient_push
View on GitHub
Stochastic Gradient Push for Distributed Deep Learning
☆172Apr 5, 2023Updated 3 years ago
xbfu / PyTorch-ParameterServer
View on GitHub
An implementation of parameter server framework in PyTorch RPC.
☆12Nov 12, 2021Updated 4 years ago
congxie1108 / iclr2020_zeno_async
View on GitHub
Source code of ICLR2020 submisstion: Zeno++: Robust Fully Asynchronous SGD
☆14Feb 2, 2020Updated 6 years ago
byteps / examples
View on GitHub
BytePS examples (Vision, NLP, GAN, etc)
☆19Nov 24, 2022Updated 3 years ago
in-ATP / ATP
View on GitHub
☆87Dec 13, 2021Updated 4 years ago
syshensyshen / pva-faster-rcnn-pytorch-1.0
View on GitHub
☆12Nov 8, 2019Updated 6 years ago
michaelfarrell76 / Distributed-SGD
View on GitHub
Parallel SGD, done locally and remote
☆14May 19, 2016Updated 10 years ago
upenn-acg / gpudrano-static-analysis_v1.0
View on GitHub
GPU Drano Static Analysis for GPU programs.
☆25Nov 16, 2018Updated 7 years ago
synxlin / deep-gradient-compression
View on GitHub
[ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
☆226Jul 10, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YukeWang96 / DSXplore_IPDPS21
View on GitHub
Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.
☆13Apr 6, 2021Updated 5 years ago
deepinsight / mxnet-operator
View on GitHub
Tools for ML/MXNet on Kubernetes.
☆43Feb 11, 2018Updated 8 years ago
HKBU-HPML / ddl-benchmarks
View on GitHub
ddl-benchmarks: Benchmarks for Distributed Deep Learning
☆36May 29, 2020Updated 6 years ago
vals / PhD-Thesis
View on GitHub
☆17Aug 31, 2017Updated 8 years ago
deepx-youjunkim / yj4889-Optimized-Quantization-for-Convolutional-Deep-Neural-Networks-in-Federated-Learning
View on GitHub
Federated learning is a distributed learning method that trains a deep network on user devices without collecting data from central serve…
☆13Jul 7, 2020Updated 6 years ago
hwang595 / ATOMO
View on GitHub
Atomo: Communication-efficient Learning via Atomic Sparsification
☆29Dec 9, 2018Updated 7 years ago
dywsjtu / apparate
View on GitHub
Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]
☆24Nov 21, 2024Updated last year
Mellanox / hw_offload_api_examples
View on GitHub
Examples of usage for Mellanox HW offloads
☆17Jan 18, 2022Updated 4 years ago
petuum / autodist
View on GitHub
Simple Distributed Deep Learning on TensorFlow
☆136Feb 5, 2026Updated 5 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
msr-fiddle / pipedream
View on GitHub
☆394Nov 4, 2022Updated 3 years ago
uw-mad-dash / Accordion
View on GitHub
Code for reproducing experiments performed for Accoridon
☆13Jun 11, 2021Updated 5 years ago
yandex-research / moshpit-sgd
View on GitHub
"Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation
☆30Feb 4, 2025Updated last year
faniyamokhayyeri / C-GAN
View on GitHub
☆12Apr 6, 2021Updated 5 years ago
loudinthecloud / dpwa
View on GitHub
Distributed Learning by Pair-Wise Averaging
☆52Oct 31, 2017Updated 8 years ago
czkkkkkk / gccl
View on GitHub
☆13Jan 23, 2021Updated 5 years ago
iotb415 / DDP
View on GitHub
pytorch DDP
☆10Nov 12, 2019Updated 6 years ago
jiahuanluo / Federated-Benchmark
View on GitHub
A Benchmark of Real-world Image Dataset for Federated Learning
☆42Oct 9, 2019Updated 6 years ago
gudiandian / ElasticFlow
View on GitHub
☆17May 10, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
l-nic / chipyard
View on GitHub
An Agile Chisel-Based SoC Design Framework
☆25Dec 29, 2021Updated 4 years ago
UofT-EcoSystem / BPPSA-open
View on GitHub
The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".
☆13Jun 7, 2021Updated 5 years ago
IBM / FedMA
View on GitHub
Code for Federated Learning with Matched Averaging, ICLR 2020.
☆342Dec 5, 2021Updated 4 years ago
amplab / cyclades
View on GitHub
Cyclades
☆28Apr 7, 2018Updated 8 years ago
opensmartnic / panic_with_cocotb
View on GitHub
☆13May 15, 2023Updated 3 years ago
HKBU-HPML / MG-WFBP
View on GitHub
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning
☆12Apr 26, 2021Updated 5 years ago
Ribosome-Packet-Processor / Ribosome
View on GitHub
High-Speed Stateful Packet Processor for Programmable Switches
☆13Dec 18, 2022Updated 3 years ago