bentoml / aws-ec2-deploy
Fast model deployment on AWS EC2
☆14Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for aws-ec2-deploy
- Fast model deployment on AWS Lambda☆14Updated 8 months ago
- ☆12Updated last year
- Fast model deployment on AWS Sagemaker☆15Updated 8 months ago
- BIG: Back In the Game of Creative AI☆25Updated last year
- Demo FastAPI WebSocket Audio☆35Updated 4 years ago
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated 7 months ago
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆19Updated this week
- 🤝 Trade any tensors over the network☆30Updated last year
- Evaluate your LLM apps, RAG pipeline, any generated text, and more!☆0Updated 6 months ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆56Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆49Updated last year
- ☆84Updated last year
- ☆19Updated 3 years ago
- Demos of some issues with LangChain.☆30Updated last year
- Plugin for deploying MLflow models to TorchServe☆106Updated last year
- Cortex-compatible model server for Python and TensorFlow☆16Updated last year
- Run LLMs on Replicate with vLLM☆15Updated last month
- ☆30Updated last year
- Simple dependency injection framework for Python☆20Updated 6 months ago
- API serving for your diffusers models☆10Updated 10 months ago
- BentoML Example Projects 🎨☆134Updated 2 years ago
- Explore vector similarity in Redis☆116Updated last year
- CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search☆53Updated 10 months ago
- Self-host LLMs with vLLM and BentoML☆74Updated last week
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆33Updated last year
- Embedding models from Jina AI☆56Updated 10 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆36Updated 7 months ago
- Build reliable, secure, and production-ready AI apps easily.☆46Updated 2 weeks ago
- ✅ Pytest-style test runner for langchain projects☆24Updated last year
- ☆25Updated 10 months ago