Langhalsdino / Kubernetes-GPU-GuideLinks
This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
☆818Updated 2 years ago
Alternatives and similar repositories for Kubernetes-GPU-Guide
Users that are interested in Kubernetes-GPU-Guide are comparing it to the libraries listed below
Sorting:
- A batch-optimized scaling manager for Kubernetes☆870Updated 6 years ago
- A GPU / device extension framework for Kubernetes☆364Updated 2 years ago
- Distributed TensorFlow basics and examples of training algorithms☆642Updated 6 years ago
- Fabric for Deep Learning (FfDL, pronounced fiddle) is a Deep Learning Platform offering TensorFlow, Caffe, PyTorch etc. as a Service on K…☆692Updated 2 months ago
- PyTorch on Kubernetes☆309Updated 3 years ago
- Compilation of Dockerfiles with automated builds enabled on the Docker Registry☆503Updated 5 years ago
- Studio: Simplify and expedite model building process☆382Updated last year
- Simple wrapper for docker-compose to use GPU enabled docker under nvidia-docker☆224Updated 7 years ago
- Annotated notes and summaries of the TensorFlow white paper, along with SVG figures and links to documentation☆434Updated 6 years ago
- 👩🔬 Train and Serve TensorFlow Models at Scale with Kubernetes and Kubeflow on Azure☆291Updated 4 years ago
- Automated Machine Learning on Kubernetes☆1,607Updated 2 weeks ago
- Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.☆623Updated 6 years ago
- A TensorBoard plugin for visualizing arbitrary tensors in a video as your network trains.☆462Updated 6 years ago
- A CNN visualizer☆1,002Updated 7 years ago
- PyTorch elastic training☆728Updated 3 years ago
- Integration of TensorFlow with other open-source frameworks☆1,371Updated 9 months ago
- Distributed ML Training and Fine-Tuning on Kubernetes☆1,844Updated this week
- Machine Learning Model Deployment Made Simple☆719Updated 6 years ago
- A REST API for Caffe using Docker and Go☆419Updated 6 years ago
- An on-premises, bare-metal solution for deploying GPU-powered applications in containers☆259Updated 9 years ago
- Input pipeline framework☆985Updated last week
- Descriptive Deep Learning☆822Updated last year
- Start Tensorboard in Jupyter Notebook☆459Updated 3 years ago
- A multi-user, distributed computing environment for running DL model training experiments on Intel® Xeon® Scalable processor-based system…☆392Updated last year
- Example for end-to-end machine learning on Kubernetes using Kubeflow and Seldon Core☆174Updated 3 years ago
- Deep Learning Dockerfiles☆157Updated 4 years ago
- A domain specific language to express machine learning workloads.☆1,759Updated 2 years ago
- A low-latency prediction-serving system☆1,416Updated 4 years ago
- Tensorflow Best Practices☆330Updated 8 years ago
- A language-agnostic interface to TensorBoard☆779Updated 7 years ago