awslabs / dynamic-training-with-apache-mxnet-on-aws
Dynamic training with Apache MXNet reduces cost and time for training deep neural networks by leveraging AWS cloud elasticity and scale. The system reduces training cost and time by dynamically updating the training cluster size during training, with minimal impact on model training accuracy.
☆56Updated 2 years ago
Alternatives and similar repositories for dynamic-training-with-apache-mxnet-on-aws:
Users that are interested in dynamic-training-with-apache-mxnet-on-aws are comparing it to the libraries listed below
- This is the documentation for AWS Deep Learning AMIs: your one-stop shop for deep learning in the cloud☆46Updated last year
- Toolkit for allowing inference and serving with MXNet in SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https…☆28Updated last year
- ☆117Updated last year
- Toolkit for running MXNet training scripts on SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https://github.c…☆60Updated last month
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆90Updated 2 years ago
- MxNet to ONNX Exporter☆56Updated 6 years ago
- Natural language processing & computer vision models optimized for AWS☆141Updated 2 years ago
- Reference Lambda function that predicts image labels for a image using an MXNet-built deep learning model. The repo also has pre-built MX…☆133Updated 2 years ago
- Distributed Deep Learning on AWS Using CloudFormation (CFN), MXNet and TensorFlow☆254Updated 5 years ago
- Scripts and instructions to facilitate running Deep Learning Tasks on Amazon EMR☆63Updated last year
- A high performance data access library for machine learning tasks☆74Updated last year
- ONNX model format support for Apache MXNet☆96Updated 6 years ago
- Machine Learning Inference Graph Spec☆21Updated 5 years ago
- ☆59Updated 3 years ago
- MXNet Model Serving☆25Updated 7 years ago
- Serving PyTorch Models on AWS Lambda with Caffe2 & ONNX☆47Updated 7 years ago
- Incubating project for xgboost operator☆76Updated 3 years ago
- Distributed training with SageMaker's script mode using Horovod distributed deep learning framework☆32Updated 5 years ago
- Train and Deploy Machine Learning Models on Kubernetes using Amazon EKS☆164Updated 5 years ago
- GraphPipe for python☆41Updated 6 years ago
- Amazon SageMaker Debugger provides functionality to save tensors during training of machine learning jobs and analyze those tensors☆161Updated 10 months ago
- Toolkit for running PyTorch training scripts on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at https://gith…☆202Updated last month
- ☆23Updated 2 years ago
- Amazon Bin Image Dataset Challenge☆54Updated 7 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆109Updated 2 years ago
- WARNING: This package has been deprecated. Please use the SageMaker Training Toolkit for model training and the SageMaker Inference Toolk…☆186Updated 4 years ago
- A Tutorial for Serving Tensorflow Models using Kubernetes☆87Updated 3 weeks ago
- Re:Invent Inf1 Instance Lab☆22Updated 4 years ago
- SageMaker specific extensions to TensorFlow.☆54Updated 8 months ago