philschmid / huggingface-inferentia2-samples
☆10Updated 8 months ago
Alternatives and similar repositories for huggingface-inferentia2-samples:
Users that are interested in huggingface-inferentia2-samples are comparing it to the libraries listed below
- ☆12Updated last month
- Article about deploying machine learning models using grpc, pytorch and asyncio☆27Updated 2 years ago
- Tools for merging pretrained large language models.☆19Updated 8 months ago
- End-to-End LLM Guide☆101Updated 7 months ago
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Updated 4 years ago
- Examples showing use of NGC containers and models withing Amazon SageMaker☆17Updated 2 years ago
- Question Answering application with Large Language Models (LLMs) and Amazon Postgresql using pgvector☆14Updated 2 months ago
- ☆53Updated last month
- Large Language Model Hosting Container☆83Updated last week
- This repository is part of a blog post that guides users through creating a visual search application using Amazon SageMaker and Amazon E…☆10Updated last year
- ☆18Updated 4 months ago
- Fine-tune Mistral 7B to generate fashion style suggestions☆34Updated last year
- Core Utilities for NVIDIA Merlin☆19Updated 6 months ago
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆13Updated last year
- Retrieval Augmented Generation applications☆26Updated last year
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆29Updated 2 years ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- ☆14Updated last year
- ☆12Updated 2 months ago
- ☆40Updated 3 months ago
- serving a torch model using Celery, Redis and RabbitMQ to serve users asynchronously☆20Updated last year
- 🚀 End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam☆27Updated 10 months ago
- ☆22Updated 2 months ago
- LLM Workshop 2024☆12Updated 4 months ago
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆36Updated 6 months ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- ☆14Updated last year
- vLLM adapter for a TGIS-compatible gRPC server.☆21Updated this week
- ☆24Updated last year