aws-samples / training-llm-on-sagemaker-for-multiple-nodes-with-deepspeedLinks
☆25Updated last year
Alternatives and similar repositories for training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed
Users that are interested in training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed are comparing it to the libraries listed below
Sorting:
- ☆44Updated 7 months ago
- AWS Generative AI Conversational RAG Reference (Galileo)☆76Updated this week
- ☆54Updated last year
- ☆44Updated last year
- Use the two different methods (deepspeed and SageMaker model parallelism library) to fine tune llama model on Sagemaker. Then deploy the …☆24Updated last year
- SageMaker Studio Docker CLI Extension☆13Updated last year
- This is a sample about how to run stanford_alpaca on Amazon SageMaker, only for demo use.☆14Updated last year
- ☆28Updated last month
- ☆72Updated 11 months ago
- ☆14Updated last year
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆59Updated last week
- ☆22Updated 2 years ago
- ☆45Updated 3 months ago
- Example code for AWS Neuron SDK developers building inference and training applications☆146Updated last week
- ☆39Updated last month
- Hands-on workshop for distributed training and hosting on SageMaker☆139Updated last week
- ☆20Updated last year
- Sample solution to build a deployment pipeline for Amazon SageMaker.☆13Updated 2 years ago
- Create and manage Amazon SageMaker HyperPod clusters, run distributed model training☆21Updated 2 weeks ago
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆43Updated last week
- MLOps End-to-End Example using Amazon SageMaker Pipeline, AWS CodePipeline and AWS CDK☆143Updated last month
- This repository is part of a blog post that guides users through creating a visual search application using Amazon SageMaker and Amazon E…☆11Updated last year
- ☆23Updated 2 months ago
- ☆24Updated this week
- Deploy and scale distributed python applications on Amazon EKS using Ray☆14Updated 2 weeks ago
- ☆57Updated 3 years ago
- ☆47Updated last month
- CLI for building Docker images in SageMaker Studio using AWS CodeBuild.☆56Updated 3 years ago
- Create an Amazon EKS cluster and run a distributed training example☆28Updated 9 months ago
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆142Updated 7 months ago