bentoml / aws-ec2-deploy
Fast model deployment on AWS EC2
☆14Updated 11 months ago
Alternatives and similar repositories for aws-ec2-deploy:
Users that are interested in aws-ec2-deploy are comparing it to the libraries listed below
- Fast model deployment on AWS Lambda☆14Updated 11 months ago
- ☆12Updated last year
- Simple dependency injection framework for Python☆20Updated 8 months ago
- Fast model deployment on AWS Sagemaker☆15Updated 11 months ago
- BIG: Back In the Game of Creative AI☆26Updated last year
- ☆84Updated last year
- ☆30Updated last year
- Run LLMs on Replicate with vLLM☆15Updated 3 months ago
- Local emulator for Hugging Face Inference Endpoints customer handlers☆25Updated last year
- ☆28Updated last year
- A lightweight Python library for running TTS models with a unified API.☆16Updated 2 weeks ago
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆22Updated 2 months ago
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆28Updated 3 weeks ago
- Using modal.com to process FineWeb-edu data☆19Updated last month
- Evaluate your LLM apps, RAG pipeline, any generated text, and more!Updated 8 months ago
- GitHub action that'll sync files from a GitHub Repo with the Hugging Face Hub 🤗☆69Updated 3 months ago
- Explore vector similarity in Redis☆115Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- The backend behind the LLM-Perf Leaderboard☆10Updated 8 months ago
- This is the repo for the container that holds the models for the text2vec-transformers module☆43Updated last week
- Command Line Interface for Hugging Face Inference Endpoints☆67Updated 9 months ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated last year
- ☆12Updated last year
- Testing methods for GPU deployment☆20Updated 2 years ago
- Prefect integrations for working with OpenAI.☆36Updated 9 months ago
- DiffusionWithAutoscaler☆29Updated 9 months ago
- utilities for loading and running text embeddings with onnx☆43Updated 5 months ago
- ☆20Updated last year