bentoml / aws-lambda-deploy
Fast model deployment on AWS Lambda
β14Updated last year
Alternatives and similar repositories for aws-lambda-deploy:
Users that are interested in aws-lambda-deploy are comparing it to the libraries listed below
- A PaaS End-to-End ML Setup with Metaflow, Serverless and SageMaker.β37Updated 4 years ago
- π€ Trade any tensors over the networkβ30Updated last year
- β18Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy modβ¦β15Updated last year
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.β30Updated 3 years ago
- β20Updated 3 years ago
- Fast model deployment on Google Cloud Runβ15Updated last year
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.β25Updated 3 years ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systemsβ10Updated last year
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- A repository that showcases how you can use ZenML with Gitβ69Updated 7 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β38Updated 11 months ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or fβ¦β24Updated 4 years ago
- This is the repo for the container that holds the models for the text2vec-transformers moduleβ49Updated last month
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β37Updated 2 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.β36Updated last year
- β13Updated 3 years ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ34Updated last year
- Research notes and extra resources for all the work at explodinggradients.comβ23Updated last week
- β13Updated 2 years ago
- Cortex-compatible model server for Python and TensorFlowβ17Updated 2 years ago
- β9Updated 4 years ago
- Sentence Embedding as a Serviceβ15Updated last year
- Framework for building and maintaining self-updating prompts for LLMsβ61Updated 9 months ago
- π οΈ Tools for Transformers compression using PyTorch Lightning β‘β82Updated 4 months ago
- β12Updated 9 months ago
- β12Updated 11 months ago
- βοΈ Parallel and distributed training with spaCy and Rayβ53Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated 3 months ago