jpmorganchase / inference-server
Deploy your AI/ML model to Amazon SageMaker for Real-Time Inference and Batch Transform using your own Docker container image.
☆48Updated last month
Alternatives and similar repositories for inference-server:
Users that are interested in inference-server are comparing it to the libraries listed below
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆19Updated 2 months ago
- ☆66Updated 6 months ago
- Question Answering application with Large Language Models (LLMs) and Amazon Postgresql using pgvector☆14Updated last month
- Tools and utilities for operating Metaflow in production☆49Updated 2 weeks ago
- This Guidance demonstrates how to configure a proxy in a virtual private cloud (VPC) to connect external services to your Amazon VPC Latt…☆11Updated 3 months ago
- This solution helps you deploy ETL processes and data storage resources to create an Insurance Lake using Amazon S3 buckets for storage, …☆13Updated 3 months ago
- A repository that showcases how you can use ZenML with Git☆69Updated 5 months ago
- Build a directory full of files into a SQLite database☆12Updated last year
- Optimising MySQL Performance with ProxySQL to replace MySQL Query Cache☆12Updated 8 months ago
- ☆25Updated last year
- ☆18Updated 4 months ago
- A pipeline to convert contextual knowledge stored in documents and databases into text embeddings, and store them in a vector store☆15Updated last year
- Customizable GitOps template for Kubeflow on AWS EKS☆10Updated 4 years ago
- This Guidance provides a set of artifacts that will guide customers in building a production monitoring architecture with AWS IoT TwinMak…☆10Updated 3 months ago
- The repository guides you through generating a synthetic dataset for a QA-RAG application using the Bedrock API, Python and Langchain.☆13Updated 4 months ago
- ☆11Updated this week
- This solution helps you deploy ETL processes and data storage resources to create an Insurance Lake using Amazon S3 buckets for storage, …☆23Updated 2 months ago
- A set of Docker images that include popular frameworks for machine learning, data science and visualization.☆102Updated this week
- Create an LLM XML context document from an llms.txt file☆14Updated 4 months ago
- Flyte Documentation 📖☆77Updated this week
- Deploy production-grade Metaflow cloud infrastructure on AWS☆60Updated 2 weeks ago
- BigBertha is an architecture design that demonstrates how automated LLMOps (Large Language Models Operations) can be achieved on any Kube…☆27Updated last year
- Explore and experiment with large language models (LLMs) available in Amazon Bedrock☆16Updated 4 months ago
- aws-solutions-library-samples / guidance-for-text-generation-using-embeddings-from-enterprise-data-on-awsThis Guidance demonstrates question answering using Retrieval Augmented Generation (RAG) with foundation models in Amazon SageMaker JumpS…☆10Updated 2 months ago
- A python client used to interact with the Private AI's API☆21Updated last month
- InGen is a command line tool written on top of pandas and great_expectations to perform small scale data transformations and validations …☆14Updated last month
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service☆20Updated last month
- Repo to experiment with Graph RAG strategies using Kùzu☆42Updated last month
- Sample Scripts to Customize SageMaker Notebook Instance☆22Updated 8 months ago