philschmid / huggingface-inferentia2-samples
☆10Updated 9 months ago
Alternatives and similar repositories for huggingface-inferentia2-samples:
Users that are interested in huggingface-inferentia2-samples are comparing it to the libraries listed below
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- The backend behind the LLM-Perf Leaderboard☆10Updated 10 months ago
- ☆12Updated last week
- A repository of PyTorch example☆9Updated last year
- Examples for using Amazon SageMaker components in Kubeflow Pipelines☆22Updated 4 years ago
- ☆16Updated 2 months ago
- ☆21Updated 2 months ago
- Experimentation on google's gemma model☆16Updated last year
- Sentence Embedding as a Service☆15Updated last year
- Deploy and Scale LLM-based applications☆26Updated last year
- Examples showing use of NGC containers and models withing Amazon SageMaker☆17Updated 2 years ago
- Large Language Model Hosting Container☆85Updated this week
- End-to-End LLM Guide☆104Updated 8 months ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆14Updated last year
- ☆44Updated 10 months ago
- ☆14Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 2 weeks ago
- ☆56Updated this week
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆13Updated last year
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated 2 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆25Updated this week
- Example code for AWS Neuron SDK developers building inference and training applications☆140Updated last month
- Tools for merging pretrained large language models.☆19Updated 9 months ago
- Rust bindings for CTranslate2☆14Updated last year
- ☆18Updated 2 weeks ago
- ☆13Updated 2 years ago
- Article about deploying machine learning models using grpc, pytorch and asyncio☆28Updated 2 years ago
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Updated 4 years ago
- ☆20Updated 3 years ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Updated 2 years ago