Crisescode / distributed-training-dlLinks
各种深度学习(DL)框架分布式训练,包括:Tensorflow、Tensorflow2、Pytorch、Chainer、Caffe、Mxnet ...
☆22Updated 5 years ago
Alternatives and similar repositories for distributed-training-dl
Users that are interested in distributed-training-dl are comparing it to the libraries listed below
Sorting:
- OneFlow models for benchmarking.☆104Updated last year
- DeepLearning Framework Performance Profiling Toolkit☆296Updated 3 years ago
- ☆35Updated 4 years ago
- FastNN provides distributed training examples that use EPL.☆85Updated 3 years ago
- PyTorch On Angel, arming PyTorch with a powerful Parameter Server, which enable PyTorch to train very big models.☆170Updated 3 months ago
- A Fast Muti-processing BERT-Inference System☆102Updated 3 years ago
- pytorch源码阅读 0.2.0 版本☆91Updated 6 years ago
- MLModelCI is a complete MLOps platform for managing, converting, profiling, and deploying MLaaS (Machine Learning-as-a-Service), bridging…☆198Updated 2 years ago
- alibabacloud-aiacc-demo☆43Updated 2 years ago
- Models and examples built with OneFlow☆101Updated last year
- Vector Search Engine base on BRPC + FAISS☆151Updated 6 years ago
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 4 years ago
- ☆25Updated 2 years ago
- The DGL Operator makes it easy to run Deep Graph Library (DGL) graph neural network training on Kubernetes☆44Updated 4 years ago
- oneflow documentation☆69Updated last year
- Simple Dynamic Batching Inference☆145Updated 3 years ago
- ☆57Updated 2 years ago
- InsNet Runs Instance-dependent Neural Networks with Padding-free Dynamic Batching.☆67Updated 4 years ago
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆271Updated 2 years ago
- CUDA 编程指南学习☆31Updated 7 years ago
- ☆219Updated 2 years ago
- ☆129Updated 4 years ago
- Transformer related optimization, including BERT, GPT☆17Updated 2 years ago
- ☆23Updated 2 years ago
- ☆79Updated 2 years ago
- A small deep-learning framework with C++/Python/CUDA☆54Updated 7 years ago
- implement bert in pure c++☆37Updated 5 years ago
- My solutions to the assignments of dlsys course (CSE599G1: Deep Learning System Spring 2017)☆10Updated 8 years ago
- ml模型分布式服务部署:grpc,flask;docker☆76Updated 5 years ago
- AI模型序列化总结☆51Updated 6 years ago