bytedance / byteps
A high performance and generic framework for distributed DNN training
☆3,676Updated last year
Alternatives and similar repositories for byteps:
Users that are interested in byteps are comparing it to the libraries listed below
- A lightweight parameter server interface☆1,544Updated 2 years ago
- Kubernetes-native Deep Learning Framework☆740Updated last year
- It is open source ebook about TensorFlow kernel and implementation mechanism.☆2,893Updated last year
- A distributed graph deep learning framework.☆2,898Updated last year
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,265Updated last year
- An industrial deep learning framework for high-dimension sparse data☆4,279Updated 7 months ago
- Collective communications library with various primitives for multi-machine training.☆1,289Updated this week
- Resource scheduling and cluster management for AI☆2,659Updated 10 months ago
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,868Updated 2 years ago
- 腾讯高性能分布式图计算框架Plato☆1,913Updated 3 years ago
- An Industrial Graph Neural Network Framework☆1,304Updated 9 months ago
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,748Updated last year
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,013Updated 6 years ago
- Bagua Speeds up PyTorch☆881Updated 8 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,449Updated this week
- ☆582Updated 7 years ago
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,519Updated 2 weeks ago
- MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.☆5,012Updated 10 months ago
- Largest multi-label image database; ResNet-101 model; 80.73% top-1 acc on ImageNet☆3,068Updated 3 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,227Updated this week
- Lightweight and Scalable framework that combines mainstream algorithms of Click-Through-Rate prediction based computational DAG, philosop…☆674Updated 5 years ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,539Updated 5 years ago
- Reference implementations of MLPerf™ training benchmarks☆1,660Updated 2 weeks ago
- An implementation of a deep learning recommendation model (DLRM)☆3,857Updated 6 months ago
- High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and …☆3,091Updated last year
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆992Updated last month
- PyTorch extensions for high performance and large scale training.☆3,306Updated 2 weeks ago
- A common bricks library for building scalable and portable distributed machine learning.☆868Updated last week
- Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.☆6,921Updated 3 months ago
- 图解tensorflow 源码☆2,177Updated 8 years ago