aws-samples / coldstart-recs-on-aws-trainium
End-to-end solution for cold-start recommendations using vLLM, DeepSeek Llama (8B & 70B), and FAISS on AWS Trainium (Trn1) with the Neuron SDK and NeuronX Distributed. Includes LLM-based interest expansion, embedding comparisons (T5 & SentenceTransformers), and scalable retrieval workflows.
☆6Updated this week
Alternatives and similar repositories for coldstart-recs-on-aws-trainium:
Users that are interested in coldstart-recs-on-aws-trainium are comparing it to the libraries listed below
- Amazon SageMaker operator for Kubernetes☆149Updated last year
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆58Updated last week
- Kubeflow workshop on EKS. Mainly focus on AWS integration examples. Please go check kubeflow website http://kubeflow.org for other exampl…☆98Updated 4 years ago
- Distributed training using Kubeflow on Amazon EKS☆87Updated this week
- ACK service controller for Amazon SageMaker☆45Updated last week
- ☆145Updated 2 years ago
- ☆46Updated last week
- Deep learning benchmark utility and optimization tips on EKS.☆48Updated 5 years ago
- ☆44Updated last year
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆43Updated last year
- ☆70Updated 10 months ago
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆139Updated 7 months ago
- SageMaker specific extensions to TensorFlow.☆54Updated 9 months ago
- This is the Docker container based on open source framework XGBoost (https://xgboost.readthedocs.io/en/latest/) to allow customers use th…☆137Updated 2 months ago
- WARNING: This package has been deprecated. Please use the SageMaker Training Toolkit for model training and the SageMaker Inference Toolk…☆186Updated 4 years ago
- CLI for building Docker images in SageMaker Studio using AWS CodeBuild.☆56Updated 3 years ago
- Experiment tracking and metric logging for Amazon SageMaker notebooks and model training.☆127Updated last year
- Toolkit for allowing inference and serving with MXNet in SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https…☆28Updated last year
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆42Updated 3 months ago
- ☆107Updated 3 months ago
- ☆24Updated 3 months ago
- This repository contains examples of Docker images that can be used as custom images for KernelGateway Apps in SageMaker Studio☆132Updated 2 years ago
- Example templates for the delivery of custom ML solutions to production so you can get started quickly without having to make too many de…☆71Updated 10 months ago
- Hosting code-server on Amazon SageMaker☆54Updated last year
- CSI Driver of Amazon FSx for Lustre https://aws.amazon.com/fsx/lustre/☆135Updated this week
- A high performance data access library for machine learning tasks☆74Updated last year
- The SageMaker Spark Container is a Docker image used to run data processing workloads with the Spark framework on Amazon SageMaker.☆37Updated 3 months ago
- ☆24Updated last year
- KubeFlow on AWS☆183Updated last month
- Serverless application to monitor an AWS Batch architecture through dashboards.☆61Updated 5 months ago