bytedance / byteps
A high performance and generic framework for distributed DNN training
☆3,661Updated last year
Alternatives and similar repositories for byteps:
Users that are interested in byteps are comparing it to the libraries listed below
- An industrial deep learning framework for high-dimension sparse data☆4,273Updated 5 months ago
- It is open source ebook about TensorFlow kernel and implementation mechanism.☆2,895Updated last year
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,754Updated last year
- Kubernetes-native Deep Learning Framework☆737Updated last year
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,254Updated last year
- A lightweight parameter server interface☆1,544Updated 2 years ago
- Collective communications library with various primitives for multi-machine training.☆1,266Updated this week
- An implementation of a deep learning recommendation model (DLRM)☆3,833Updated 4 months ago
- A distributed graph deep learning framework.☆2,897Updated last year
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,801Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,510Updated last year
- 腾讯高性能分布式图计算框架Plato☆1,906Updated 3 years ago
- Bagua Speeds up PyTorch☆878Updated 7 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,403Updated last month
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,536Updated 5 years ago
- Resource scheduling and cluster management for AI☆2,648Updated 8 months ago
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆973Updated 5 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,057Updated this week
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,009Updated 6 years ago
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,709Updated last year
- Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.☆3,124Updated 3 weeks ago
- An Industrial Graph Neural Network Framework☆1,294Updated 8 months ago
- Generate embeddings from large-scale graph-structured data.☆3,399Updated last year
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,295Updated this week
- ☆577Updated 6 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,264Updated this week
- DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foun…☆1,075Updated last month
- Open-source implementation of Google Vizier for hyper parameters tuning☆1,552Updated 5 years ago
- Baidu Bigflow is an interface that allows for writing distributed computing programs and provides lots of simple, flexible, powerful APIs…☆1,135Updated 2 years ago
- Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL☆543Updated 4 years ago