philschmid / huggingface-inferentia2-samples
☆10Updated 7 months ago
Alternatives and similar repositories for huggingface-inferentia2-samples:
Users that are interested in huggingface-inferentia2-samples are comparing it to the libraries listed below
- Large Language Model Hosting Container☆80Updated last week
- Examples showing use of NGC containers and models withing Amazon SageMaker☆17Updated 2 years ago
- ☆11Updated last month
- Tools for merging pretrained large language models.☆19Updated 7 months ago
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆17Updated last year
- Question Answering application with Large Language Models (LLMs) and Amazon Postgresql using pgvector☆14Updated last month
- ☆20Updated 2 months ago
- Deploy and Scale LLM-based applications☆26Updated last year
- Article about deploying machine learning models using grpc, pytorch and asyncio☆27Updated 2 years ago
- LLMPerf is a library for validating and benchmarking LLMs☆10Updated 5 months ago
- 🚀 End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam☆27Updated 9 months ago
- Fine-tune Mistral 7B to generate fashion style suggestions☆33Updated last year
- ☆19Updated last year
- ☆12Updated last week
- ☆17Updated last year
- A repository of PyTorch example☆10Updated last year
- TAO Toolkit deep learning networks with TensorFlow 1.x backend☆13Updated 11 months ago
- Example code for AWS Neuron SDK developers building inference and training applications☆132Updated this week
- End-to-End LLM Guide☆99Updated 6 months ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- ☆18Updated last week
- A do-framework project to simplify deployment of Kubeflow on Amazon EKS☆20Updated last week
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆32Updated 5 months ago
- serving a torch model using Celery, Redis and RabbitMQ to serve users asynchronously☆20Updated 11 months ago
- ☆28Updated last year
- PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…☆31Updated last year
- ☆12Updated last year
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- A collection of examples demonstrating how to use dstack☆26Updated 7 months ago
- Examples of using Evidently to evaluate, test and monitor ML models.☆18Updated last month