☆57Feb 5, 2026Updated 3 weeks ago
Alternatives and similar repositories for awsome-inference
Users that are interested in awsome-inference are comparing it to the libraries listed below
Sorting:
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆396Updated this week
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆255Apr 11, 2025Updated 10 months ago
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆46May 29, 2025Updated 9 months ago
- ☆20Dec 9, 2025Updated 2 months ago
- This Guidance shows how to build an Amazon Elastic Compute Cloud (Amazon EC2) Spot placement score tracker to monitor unused Amazon EC2 S…☆48Feb 12, 2026Updated 2 weeks ago
- Scripts to customize AWS ParallelCluster☆28Sep 5, 2025Updated 5 months ago
- ☆12Nov 6, 2024Updated last year
- ☆12Oct 8, 2021Updated 4 years ago
- ☆133Updated this week
- Notebooks and sample code for Build On Trainium☆47Jan 14, 2026Updated last month
- ☆12Nov 28, 2024Updated last year
- ☆14Dec 20, 2025Updated 2 months ago
- Some crazy experiments☆35Sep 3, 2025Updated 6 months ago
- ☆18Oct 8, 2024Updated last year
- ☆84Feb 17, 2026Updated 2 weeks ago
- ☆18Nov 13, 2023Updated 2 years ago
- ☆19Jun 9, 2024Updated last year
- Examples showing use of NGC containers and models withing Amazon SageMaker☆17Oct 4, 2022Updated 3 years ago
- ☆18Feb 5, 2025Updated last year
- Mistral on AWS examples for Bedrock & SageMaker☆89Updated this week
- This project contains the webapp sample integrated with AWS HealthOmics, which allows users such as admin and bioinformaticians to operat…☆19Nov 20, 2024Updated last year
- ☆30Feb 2, 2026Updated last month
- Distributed preprocessing and data loading for language datasets☆40Apr 10, 2024Updated last year
- A hardware-agnostic (NVIDIA's GPUs and AWS Inferentia accelerators) deployment of computer-vision models (e.g., YOLO, ViT), generate tex…☆27Jul 1, 2025Updated 8 months ago
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆224Feb 21, 2026Updated last week
- Infrastructure as code for GPU accelerated managed Kubernetes clusters.☆57Apr 30, 2025Updated 10 months ago
- A rust client for the Crossref-API☆20Mar 22, 2024Updated last year
- A CLI tool that helps manage training jobs on the SageMaker HyperPod clusters orchestrated by Amazon EKS☆33Feb 25, 2026Updated last week
- Use the two different methods (deepspeed and SageMaker model parallelism library) to fine tune llama model on Sagemaker. Then deploy the …☆24Aug 1, 2023Updated 2 years ago
- Hands-on workshop for distributed training and hosting on SageMaker☆152Nov 4, 2025Updated 3 months ago
- Zero administration inference with AWS Lambda for 🤗☆63Feb 21, 2022Updated 4 years ago
- Hosts the files for a Red Hat OpenShift Service on AWS workshop.☆31Aug 22, 2024Updated last year
- ☆28Mar 17, 2025Updated 11 months ago
- Manage AWS ParallelCluster through an easy to use web interface☆67Mar 13, 2023Updated 2 years ago
- A Next.js sample app utilizing AWS Amplify, AWS AppSync, and Amazon Bedrock to develop an AI-powered Recipe Generator☆29Jan 29, 2025Updated last year
- ☆33Jul 7, 2022Updated 3 years ago
- ☆33Feb 22, 2024Updated 2 years ago
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆64Feb 20, 2026Updated last week
- Create an Amazon EKS cluster and run a distributed training example☆29Aug 19, 2024Updated last year