philschmid / huggingface-inferentia2-samplesLinks
☆10Updated last year
Alternatives and similar repositories for huggingface-inferentia2-samples
Users that are interested in huggingface-inferentia2-samples are comparing it to the libraries listed below
Sorting:
- Examples showing use of NGC containers and models withing Amazon SageMaker☆17Updated 2 years ago
- ☆24Updated 3 weeks ago
- Article about deploying machine learning models using grpc, pytorch and asyncio☆28Updated 2 years ago
- CMP314 Optimizing NLP models with Amazon EC2 Inf1 instances in Amazon Sagemaker☆14Updated last year
- ☆23Updated last month
- A do-framework project to simplify deployment of Kubeflow on Amazon EKS☆21Updated 4 months ago
- ☆24Updated last year
- Sample code for parallelizing across multiple CPU/GPUs on a single machine to speed up deep learning inference☆33Updated 5 years ago
- Large Language Model Hosting Container☆89Updated this week
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Updated 4 years ago
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service☆25Updated 6 months ago
- ☆14Updated last year
- Example code for AWS Neuron SDK developers building inference and training applications☆149Updated 2 weeks ago
- The backend behind the LLM-Perf Leaderboard☆10Updated last year
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆44Updated 3 weeks ago
- ☆44Updated 7 months ago
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆17Updated last year
- This repository is part of a blog post that guides users through creating a visual search application using Amazon SageMaker and Amazon E…☆11Updated last year
- ☆13Updated last year
- Deploy and Scale LLM-based applications☆26Updated 2 years ago
- Deploy and scale distributed python applications on Amazon EKS using Ray☆15Updated last week
- Sentence Embedding as a Service☆15Updated last year
- ☆18Updated last year
- Examples for using Amazon SageMaker components in Kubeflow Pipelines☆22Updated 5 years ago
- ☆47Updated last month
- ☆44Updated last year
- ☆21Updated 2 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month
- ☆28Updated last year