bentoml / aws-lambda-deploy
Fast model deployment on AWS Lambda
☆14Updated 11 months ago
Alternatives and similar repositories for aws-lambda-deploy:
Users that are interested in aws-lambda-deploy are comparing it to the libraries listed below
- Fast model deployment on AWS Sagemaker☆15Updated 11 months ago
- ☆20Updated 3 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- ☆12Updated last year
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 3 years ago
- MLFlow Deployment Plugin for Ray Serve☆43Updated 2 years ago
- ☆57Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 2 months ago
- ☆13Updated 3 years ago
- Examples of using Evidently to evaluate, test and monitor ML models.☆19Updated this week
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆32Updated 2 years ago
- Codes, scripts, and notebooks on various aspects of transformer models.☆27Updated last year
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 10 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated 10 months ago
- A PaaS End-to-End ML Setup with Metaflow, Serverless and SageMaker.☆37Updated 4 years ago
- An efficient, to-the-point, and easy-to-use checklist to following when deploying an ML model into production.☆30Updated 2 years ago
- 🤝 Trade any tensors over the network☆30Updated last year
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36Updated last year
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆57Updated last year
- ☆28Updated last year
- Metaflow flows for analyzing topics and sentiments in Hacker News☆21Updated 6 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- No Teacher BART distillation experiment for NLI tasks☆27Updated 4 years ago
- Machine learning utilities for model conversion, serialization, loading etc☆27Updated 2 years ago
- Command Line Interface for Hugging Face Inference Endpoints☆67Updated 10 months ago
- ☆13Updated last year
- Fast model deployment on Google Cloud Run☆15Updated 11 months ago