QSGD-TF
☆21May 15, 2019Updated 7 years ago
Alternatives and similar repositories for QSGD-TF
Users that are interested in QSGD-TF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- gTop-k S-SGD: A Communication-Efficient Distributed Synchronous SGD Algorithm for Deep Learning☆37Aug 19, 2019Updated 6 years ago
- Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727☆151Oct 29, 2024Updated last year
- ☆77Jun 7, 2019Updated 7 years ago
- Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)☆182Nov 19, 2018Updated 7 years ago
- Layer-wise Sparsification of Distributed Deep Learning☆10Jul 6, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pytorch distributed backend extension with compression support☆17Mar 24, 2025Updated last year
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81May 23, 2022Updated 4 years ago
- Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356☆74Sep 10, 2020Updated 5 years ago
- Simple Hierarchical Count Sketch in Python☆21Jun 3, 2021Updated 5 years ago
- GRACE - GRAdient ComprEssion for distributed deep learning☆141Jul 23, 2024Updated last year
- MPI for Torch☆60May 22, 2017Updated 9 years ago
- Proximal Asynchronous SAGA☆13Nov 30, 2017Updated 8 years ago
- ☆10Apr 29, 2024Updated 2 years ago
- AN EFFICIENT AND GENERAL FRAMEWORK FOR LAYERWISE-ADAPTIVE GRADIENT COMPRESSION☆16Oct 27, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for reproducing experiments performed for Accoridon☆13Jun 11, 2021Updated 5 years ago
- Audio Keyword Search☆12May 5, 2019Updated 7 years ago
- Ouroboros: On Accelerating Training of Transformer-Based Language Models☆10Nov 7, 2019Updated 6 years ago
- CIKM 2021 Full Paper: FedMatch: Federated Learning Over Heterogeneous Question Answering Data☆12Dec 14, 2021Updated 4 years ago
- Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication☆15Mar 25, 2024Updated 2 years ago
- Sampled Softmax Implementation for PyTorch☆44Mar 7, 2018Updated 8 years ago
- Dynamic Weighted Majority for Imbalance Learning☆15Nov 4, 2019Updated 6 years ago
- Code for LIT, ICML 2019☆22Jun 11, 2019Updated 7 years ago
- MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning☆12Apr 26, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple script to plot the Roofline model for given HW platforms and applications☆10Mar 17, 2026Updated 3 months ago
- Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"☆17Aug 4, 2020Updated 5 years ago
- Code for paper: Variance Reduced Local SGD with Lower Communication Complexity☆12May 20, 2020Updated 6 years ago
- Data Driven Dynamic Hybrid Renewable Energy design and simulation framework☆12May 5, 2020Updated 6 years ago
- https://arxiv.org/abs/1706.04972☆45Dec 6, 2018Updated 7 years ago
- implement distributed machine learning with Pytorch + OpenMPI☆53Mar 22, 2019Updated 7 years ago
- Convolutional 3D autoencoder☆14Aug 21, 2016Updated 9 years ago
- High-Performance Machine Learning Primitives☆13Apr 17, 2021Updated 5 years ago
- ZOSVRG-BlackBox-Adv☆13Oct 30, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Sparse Matrix-Matrix Multiplication Benchmark on Intel Xeon and Xeon Phi (KNC, KNL) from blog post:☆12Sep 25, 2016Updated 9 years ago
- This repo contains the code used for NeurIPS 2019 paper "Asymmetric Valleys: Beyond Sharp and Flat Local Minima".☆14Oct 25, 2019Updated 6 years ago
- [ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training☆226Jul 10, 2024Updated last year
- The code for "Improved Deep Leakage from Gradients" (iDLG).☆165Mar 4, 2021Updated 5 years ago
- Dark channel Haze removal algorithm with CUDA acceleration (typically 10x or more speedup using a Nvidia GPU)☆14Dec 7, 2017Updated 8 years ago
- ☆19Apr 20, 2018Updated 8 years ago
- ☆52Sep 5, 2020Updated 5 years ago