aws-neuron / upstreaming-to-vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆12Updated this week
Alternatives and similar repositories for upstreaming-to-vllm:
Users that are interested in upstreaming-to-vllm are comparing it to the libraries listed below
- ☆44Updated last month
- Example code for AWS Neuron SDK developers building inference and training applications☆140Updated last month
- ☆87Updated this week
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆231Updated 3 weeks ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆44Updated 9 months ago
- ☆59Updated 9 months ago
- ☆103Updated 2 months ago
- Mistral on AWS examples for Bedrock & SageMaker☆60Updated this week
- ☆41Updated 4 months ago
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆54Updated this week
- ☆76Updated last week
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆268Updated this week
- ☆52Updated last month
- ☆38Updated last week
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆42Updated last month
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆42Updated last year
- ☆23Updated last month
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆221Updated this week
- Foundation Model Evaluations Library☆239Updated 2 weeks ago
- Secure and scalable MLOps platform on AWS using Terraform.☆40Updated this week
- Hands-on workshop for distributed training and hosting on SageMaker☆133Updated last month
- ☆13Updated 2 months ago
- AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large L…☆68Updated this week
- ☆53Updated this week
- ☆14Updated 4 months ago
- ☆33Updated last year
- ☆28Updated 10 months ago
- A generative AI-powered framework for testing virtual agents.☆209Updated 2 weeks ago
- ☆37Updated 4 months ago
- ☆20Updated 9 months ago