aws-samples / training-llm-on-sagemaker-for-multiple-nodes-with-deepspeedLinks
☆26Updated last year
Alternatives and similar repositories for training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed
Users that are interested in training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed are comparing it to the libraries listed below
Sorting:
- ☆45Updated last year
- ☆44Updated 2 months ago
- Use LLMs for building real-world apps☆112Updated 9 months ago
- Use the two different methods (deepspeed and SageMaker model parallelism library) to fine tune llama model on Sagemaker. Then deploy the …☆24Updated 2 years ago
- Example code for AWS Neuron SDK developers building inference and training applications☆149Updated 2 weeks ago
- ☆53Updated last year
- This repository is part of a blog post that guides users through creating a visual search application using Amazon SageMaker and Amazon E…☆11Updated 2 years ago
- ☆22Updated 2 years ago
- MLOps End-to-End Example using Amazon SageMaker Pipeline, AWS CodePipeline and AWS CDK☆149Updated 6 months ago
- ☆89Updated 2 years ago
- Hands-on workshop for distributed training and hosting on SageMaker☆148Updated 2 weeks ago
- Large Language Model Hosting Container☆90Updated 3 weeks ago
- Create an Amazon EKS cluster and run a distributed training example☆29Updated last year
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆140Updated last year
- ☆72Updated this week
- ☆64Updated last year
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆61Updated last week
- ☆55Updated 4 months ago
- ☆73Updated last year
- ☆110Updated 9 months ago
- CLI for building Docker images in SageMaker Studio using AWS CodeBuild.☆56Updated 3 years ago
- ☆63Updated 6 months ago
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆31Updated 2 years ago
- ☆268Updated 6 months ago
- ☆45Updated 8 months ago
- AWS Generative AI Conversational RAG Reference (Galileo)☆80Updated 2 weeks ago
- ☆24Updated 4 months ago
- Amazon SageMaker Managed Spot Training Examples☆50Updated last year
- ☆20Updated 9 months ago
- ☆57Updated 3 years ago