awslabs / s3-connector-for-pytorch
The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.
☆110Updated this week
Related projects: ⓘ
- ☆168Updated last year
- ☆94Updated this week
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆40Updated last year
- Example code for AWS Neuron SDK developers building inference and training applications☆120Updated 2 weeks ago
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆193Updated this week
- ☆62Updated 2 months ago
- ☆46Updated last week
- Deploying EFA in EKS utilizing GPUDirectRDMA where supported☆35Updated 3 months ago
- A helper library to connect into Amazon SageMaker with AWS Systems Manager and SSH (Secure Shell)☆217Updated last week
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆177Updated this week
- ☆39Updated this week
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆37Updated 2 months ago
- A high performance data access library for machine learning tasks☆74Updated 9 months ago
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆134Updated 3 months ago
- ☆21Updated 5 months ago
- Amazon SageMaker Debugger provides functionality to save tensors during training of machine learning jobs and analyze those tensors☆158Updated 4 months ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆324Updated last week
- ☆13Updated last week
- Hosting code-server on Amazon SageMaker☆50Updated 11 months ago
- ☆14Updated 5 months ago
- Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…☆442Updated this week
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆200Updated 9 months ago
- Module, Model, and Tensor Serialization/Deserialization☆175Updated 3 weeks ago
- Infrastructure as code for GPU accelerated managed Kubernetes clusters.☆45Updated 4 months ago
- Staging area for ongoing enhancements to Ray focused on improving integration with AWS and other Amazon technologies.☆66Updated 8 months ago
- KubeFlow on AWS☆164Updated last month
- This is the Docker container based on open source framework XGBoost (https://xgboost.readthedocs.io/en/latest/) to allow customers use th…☆123Updated last week
- Amazon SageMaker Managed Spot Training Examples☆50Updated 2 months ago
- CLI for building Docker images in SageMaker Studio using AWS CodeBuild.☆57Updated 2 years ago
- Implementations of Amazon SageMaker-compatible custom containers for training.☆25Updated 3 years ago