aws-samples / coldstart-recs-on-aws-trainiumLinks
End-to-end solution for cold-start recommendations using vLLM, DeepSeek Llama (8B & 70B), and FAISS on AWS Trainium (Trn1) with the Neuron SDK and NeuronX Distributed. Includes LLM-based interest expansion, embedding comparisons (T5 & SentenceTransformers), and scalable retrieval workflows.
☆7Updated last month
Alternatives and similar repositories for coldstart-recs-on-aws-trainium
Users that are interested in coldstart-recs-on-aws-trainium are comparing it to the libraries listed below
Sorting:
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆59Updated last week
- Kubeflow workshop on EKS. Mainly focus on AWS integration examples. Please go check kubeflow website http://kubeflow.org for other exampl…☆98Updated 4 years ago
- MLOps on Amazon EKS☆87Updated this week
- Amazon SageMaker operator for Kubernetes☆149Updated last year
- ☆47Updated last month
- ACK service controller for Amazon SageMaker☆47Updated last week
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆43Updated last week
- ☆24Updated this week
- ☆108Updated 4 months ago
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆43Updated last year
- Deploying EFA in EKS utilizing GPUDirectRDMA where supported☆37Updated 7 months ago
- Create and manage Amazon SageMaker HyperPod clusters, run distributed model training☆21Updated 2 weeks ago
- Deep learning benchmark utility and optimization tips on EKS.☆48Updated 5 years ago
- Argoflow-AWS has been superseded by deployKF☆44Updated last year
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆45Updated 3 weeks ago
- Toolkit for allowing inference and serving with MXNet in SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https…☆28Updated last year
- CLI for building Docker images in SageMaker Studio using AWS CodeBuild.☆56Updated 3 years ago
- ☆57Updated 2 weeks ago
- ☆72Updated 11 months ago
- A Data Platform built for AWS, powered by Kubernetes.☆148Updated last year
- Example code for AWS Neuron SDK developers building inference and training applications☆146Updated last week
- ☆145Updated 2 years ago
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆205Updated last year
- ☆24Updated last year
- KubeFlow on AWS☆183Updated 2 months ago
- SageMaker specific extensions to TensorFlow.☆54Updated 10 months ago
- ☆44Updated last year
- CloudFormation to setup Kubeflow and Sagemaker Operators on EKS☆25Updated 2 years ago
- Toolkit for running MXNet training scripts on SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https://github.c…☆60Updated 3 months ago
- A TensorFlow Serving solution for use in SageMaker. This repo is now deprecated.☆173Updated last year