bytedance / byteps
A high performance and generic framework for distributed DNN training
☆3,660Updated last year
Alternatives and similar repositories for byteps:
Users that are interested in byteps are comparing it to the libraries listed below
- An industrial deep learning framework for high-dimension sparse data☆4,265Updated 4 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,344Updated last month
- Kubernetes-native Deep Learning Framework☆735Updated last year
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,752Updated last year
- A lightweight parameter server interface☆1,543Updated 2 years ago
- Bagua Speeds up PyTorch☆877Updated 5 months ago
- A distributed graph deep learning framework.☆2,900Updated last year
- 腾讯高性能分布式图计算框架Plato☆1,903Updated 3 years ago
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,243Updated last year
- It is open source ebook about TensorFlow kernel and implementation mechanism.☆2,893Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆11,953Updated this week
- Resource scheduling and cluster management for AI☆2,646Updated 7 months ago
- An implementation of a deep learning recommendation model (DLRM)☆3,819Updated 3 months ago
- Collective communications library with various primitives for multi-machine training.☆1,255Updated 3 weeks ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,005Updated 6 years ago
- A flexible, high-performance serving system for machine learning models☆6,218Updated this week
- Optimized primitives for collective multi-GPU communication☆3,393Updated last week
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,502Updated last year
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,795Updated last year
- Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.☆3,112Updated 3 weeks ago
- ☆578Updated 6 years ago
- A library for efficient similarity search and clustering of dense vectors.☆32,556Updated this week
- High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and …☆3,086Updated last year
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,709Updated last year
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,253Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,513Updated last month
- An optimizer that trains as fast as Adam and as good as SGD.☆2,910Updated last year
- Open standard for machine learning interoperability☆18,313Updated this week
- Lingvo☆2,825Updated last week
- Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.☆6,920Updated 2 weeks ago