bytedance / bytepsLinks
A high performance and generic framework for distributed DNN training
☆3,700Updated last year
Alternatives and similar repositories for byteps
Users that are interested in byteps are comparing it to the libraries listed below
Sorting:
- An industrial deep learning framework for high-dimension sparse data☆4,298Updated 11 months ago
- Kubernetes-native Deep Learning Framework☆743Updated last year
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,290Updated 2 years ago
- A lightweight parameter server interface☆1,559Updated 2 years ago
- It is open source ebook about TensorFlow kernel and implementation mechanism.☆2,891Updated 2 years ago
- 腾讯高性能分布式图计算框架Plato☆1,913Updated 4 years ago
- Collective communications library with various primitives for multi-machine training.☆1,352Updated 3 weeks ago
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,741Updated last year
- An implementation of a deep learning recommendation model (DLRM)☆3,961Updated 2 weeks ago
- Bagua Speeds up PyTorch☆883Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,532Updated last month
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,910Updated 2 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,018Updated 6 years ago
- MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.☆4,610Updated last year
- Resource scheduling and cluster management for AI☆2,675Updated last year
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆1,033Updated 5 months ago
- ☆593Updated 7 years ago
- Baidu Bigflow is an interface that allows for writing distributed computing programs and provides lots of simple, flexible, powerful APIs…☆1,133Updated 3 weeks ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,592Updated last month
- Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.☆3,207Updated last month
- MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.☆5,028Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,613Updated this week
- 🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Mod…☆3,266Updated last month
- DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projec…☆1,597Updated 5 months ago
- Optimized primitives for collective multi-GPU communication☆4,051Updated this week
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,544Updated 6 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,440Updated this week
- FeatherCNN is a high performance inference engine for convolutional neural networks.☆1,218Updated 5 years ago
- A benchmark framework for Tensorflow☆1,149Updated last year
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆893Updated 8 months ago