bentoml / aws-ec2-deployLinks
Fast model deployment on AWS EC2
☆14Updated last year
Alternatives and similar repositories for aws-ec2-deploy
Users that are interested in aws-ec2-deploy are comparing it to the libraries listed below
Sorting:
- Fast model deployment on AWS Lambda☆14Updated last year
- ☆13Updated last year
- BIG: Back In the Game of Creative AI☆27Updated 2 years ago
- Fast model deployment on AWS Sagemaker☆16Updated last year
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆26Updated 6 months ago
- ☆28Updated last year
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- Plugin for deploying MLflow models to TorchServe☆109Updated 2 years ago
- Like picoGPT but for BERT.☆49Updated 2 years ago
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated last year
- ☆41Updated 10 months ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- Run LLMs on Replicate with vLLM☆17Updated 7 months ago
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- ☆20Updated 4 years ago
- Evaluate your LLM apps, RAG pipeline, any generated text, and more!☆1Updated last year
- BentoML Example Projects 🎨☆138Updated 5 months ago
- ☆48Updated last year
- Plugin for LLM adding support for Google's PaLM 2 model☆14Updated last year
- Testing methods for GPU deployment☆20Updated 2 years ago
- Dataset registry DVC project☆73Updated last year
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆39Updated 9 months ago
- Machine learning tool-set for Paperspace VMs☆57Updated last year
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow☆63Updated last year
- Demos of some issues with LangChain.☆31Updated last year
- Deploy and Scale LLM-based applications☆26Updated last year
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- Simple dependency injection framework for Python☆21Updated last year
- Explore vector similarity in Redis☆115Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆51Updated 2 months ago