Peidong-Wang / Distributed-TensorFlow-Using-MPILinks
Template for Deploying Distributed TensorFlow on Clusters Using MPI
☆15Updated 6 years ago
Alternatives and similar repositories for Distributed-TensorFlow-Using-MPI
Users that are interested in Distributed-TensorFlow-Using-MPI are comparing it to the libraries listed below
Sorting:
- AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks☆42Updated 8 years ago
- This repository contains the results and code for the MLPerf™ Training v0.6 benchmark.☆42Updated 2 years ago
- ☆16Updated 3 years ago
- Unofficial pytorch implementation of ReZero in ResNet☆24Updated 5 years ago
- FluidNet re-written with ATen tensor lib☆52Updated 6 years ago
- ☆14Updated 3 years ago
- Tensorflow implementation of preconditioned stochastic gradient descent☆34Updated 2 years ago
- Personal collection of references for high performance mixed precision training.☆41Updated 6 years ago
- Code release to reproduce ASHA experiments from "Random Search and Reproducibility for NAS."☆22Updated 6 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 5 years ago
- ☆70Updated 2 years ago
- Deep learning with a multiplication budget☆47Updated 7 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated 2 years ago
- Introduction to CUDA programming☆129Updated 8 years ago
- Use TensorFlow efficiently☆96Updated 4 years ago
- Example repository for custom C++/CUDA operators for TorchScript☆114Updated 3 years ago
- ☆23Updated 6 years ago
- AutoGrow: Automatic Layer Growing in Deep Convolutional Networks (KDD 2020)☆40Updated 6 years ago
- diffGrad: An Optimization Method for Convolutional Neural Networks☆55Updated 3 years ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 6 years ago
- Example code to create and train a Pytorch model using the new C++ frontend.☆17Updated 6 years ago
- NVIDIA GPU tools - monitoring on CLI & web app with multiple agents☆90Updated last year
- Bugfixing fork of Python bindings for the NVIDIA GPU Management Library☆51Updated 8 years ago
- Wasserstein / earth mover's distance visualizations☆66Updated 8 years ago
- The latest version of Net-Trim which solves the actual constrained problem☆23Updated 5 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 8 years ago
- ☆13Updated 8 years ago
- This tool dumps images in tensorboard☆17Updated 5 years ago
- Cost-Effective Object Detection: Active Sample Mining with Switchable Selection Criteria☆12Updated 7 years ago
- implement distributed machine learning with Pytorch + OpenMPI☆53Updated 6 years ago