aws-samples / training-llm-on-sagemaker-for-multiple-nodes-with-deepspeedLinks

☆25

Alternatives and similar repositories for training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed

Users that are interested in training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed are comparing it to the libraries listed below

Sorting:

aws-samples / llm-evaluation-methodology
☆44Updated 8 months ago
aws-samples / sagemaker-hosting
☆44Updated last year
yuhuiaws / finetuning-and-deploying-llama-on-Sagemaker
Use the two different methods (deepspeed and SageMaker model parallelism library) to fine tune llama model on Sagemaker. Then deploy the …
☆24Updated last year
aws-neuron / transformers-neuronx
☆111Updated 6 months ago
aws-neuron / aws-neuron-samples
Example code for AWS Neuron SDK developers building inference and training applications
☆148Updated last month
aws-samples / sagemaker-distributed-training-workshop
Hands-on workshop for distributed training and hosting on SageMaker
☆143Updated last week
aws-samples / aiml-genai-multimodal-agent
☆54Updated last year
aws-samples / mlops-e2e
MLOps End-to-End Example using Amazon SageMaker Pipeline, AWS CodePipeline and AWS CDK
☆147Updated 2 months ago
snowolf / alpaca-on-amazon-sagemaker
This is a sample about how to run stanford_alpaca on Amazon SageMaker, only for demo use.
☆14Updated 2 years ago
aws / sagemaker-huggingface-inference-toolkit
☆264Updated 2 months ago
awslabs / extending-the-context-length-of-open-source-llms
☆56Updated 3 weeks ago
aws-samples / aws-samples-for-ray
☆72Updated last year
cohere-ai / cohere-aws
☆62Updated 2 months ago
philschmid / deepspeed-sagemaker-example
☆22Updated 2 years ago
huggingface / optimum-neuron
Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
☆233Updated this week
aws-samples / aws-genai-conversational-rag-reference
AWS Generative AI Conversational RAG Reference (Galileo)
☆77Updated last week
philschmid / sagemaker-huggingface-llama-2-samples
☆88Updated last year
aws / sagemaker-pytorch-inference-toolkit
Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…
☆141Updated 9 months ago
aws-samples / amazon-sagemaker-bert-pytorch
☆64Updated last year
awslabs / llm-hosting-container
Large Language Model Hosting Container
☆89Updated 2 weeks ago
build-on-aws / bedrock-agent-txt2sql
Use natural language to Generate Amazon Athena SQL queries to fetch data.
☆88Updated 8 months ago
aws-samples / amazon-sagemaker-managed-spot-training
Amazon SageMaker Managed Spot Training Examples
☆51Updated last year
awslabs / agent-evaluation
A generative AI-powered framework for testing virtual agents.
☆263Updated 3 months ago
aws-samples / sagemaker-ssh-helper
A helper library to connect into Amazon SageMaker with AWS Systems Manager and SSH (Secure Shell)
☆247Updated last week
aws-samples / sagemaker-studio-image-build-cli
CLI for building Docker images in SageMaker Studio using AWS CodeBuild.
☆56Updated 3 years ago
aws-samples / aws-do-eks
Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…
☆60Updated last month
aws-samples / llm-apps-workshop
Use LLMs for building real-world apps
☆115Updated 6 months ago
aws-samples / amazon-sagemaker-secure-mlops
☆88Updated 2 years ago
aws / fmeval
Foundation Model Evaluations Library
☆258Updated 2 weeks ago
philschmid / cdk-samples
☆57Updated 3 years ago