baidu-research / tensorflow-allreduce
☆374Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for tensorflow-allreduce
- ☆568Updated 6 years ago
- [Deprecated] The TensorFlow Profiler (TFProf) UI provides a visual interface for profiling TensorFlow models.☆136Updated 5 years ago
- Deep learning system course☆218Updated 5 years ago
- Reliable Allreduce and Broadcast Interface for distributed machine learning☆507Updated 4 years ago
- GPU-specialized parameter server for GPU machine learning.☆100Updated 6 years ago
- Distributed Factorization Machines☆296Updated 8 years ago
- PMLS-Caffe: Distributed Deep Learning Framework for Parallel ML System☆194Updated 6 years ago
- Distributed TensorFlow basics and examples of training algorithms☆643Updated 6 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆291Updated 5 years ago
- Benchmarking State-of-the-Art Deep Learning Software Tools☆170Updated 7 years ago
- Scripts with example usage of tensorflow profiler☆83Updated 7 years ago
- Documentation for StreamExecutor open source proposal☆83Updated 8 years ago
- Collective communications library with various primitives for multi-machine training.☆1,227Updated this week
- ☆127Updated 6 years ago
- auto-tuning momentum SGD optimizer☆423Updated 6 years ago
- Papers and blogs related to distributed deep learning☆97Updated 6 years ago
- moved to https://github.com/dmlc/ps-lite☆649Updated 9 years ago
- A common bricks library for building scalable and portable distributed machine learning.☆865Updated 5 months ago
- LR、FM model solved by ftrl and sgd parallel on MPI☆111Updated 6 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆126Updated 7 years ago
- CS294; AI For Systems and Systems For AI☆221Updated 5 years ago
- Minimal numerical computation library with TensorFlow APIs☆301Updated 5 years ago
- Distributed LR、 FM model on Parameter Server. FTRL and SGD Optimization Algorithm.☆221Updated 6 years ago
- CVPR 2017 Tutorial☆330Updated 7 years ago
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆946Updated last month
- DAWNBench: An End-to-End Deep Learning Benchmark and Competition☆262Updated 4 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆308Updated 7 years ago
- Assignment 1: automatic differentiation☆475Updated 5 years ago
- a parameter server for distributed machine learning applications☆105Updated 8 years ago
- A lightweight parameter server interface☆1,539Updated last year