microsoft / azureml-inference-serverLinks
The AzureML Inference Server is a python package that allows user to easily expose machine learning models as HTTP Endpoints. The server is included by default in AzureML's pre-built docker images for inference.
☆25Updated 3 weeks ago
Alternatives and similar repositories for azureml-inference-server
Users that are interested in azureml-inference-server are comparing it to the libraries listed below
Sorting:
- Guided accelerator consolidating best practice patterns, IaaC and AML code artefacts to provide a reference approach to implementing MLOp…☆50Updated last year
- This sample demonstrates how to load balance requests between multiple Azure OpenAI Services using Azure API Management.☆35Updated last year
- Azure OpenAI benchmarking tool☆148Updated last year
- Scalable solution for ML Observability☆36Updated last year
- Monitors and processes traffic to and from Azure OpenAI endpoints.☆109Updated last week
- Smart load balancing for Azure OpenAI endpoints☆54Updated last year
- ☆32Updated last year
- This sample shows how to take a ChatGPT prompt as HTTP Get or Post input, calculates the completions using OpenAI ChatGPT service, all ho…☆19Updated 9 months ago
- Smart load balancing for OpenAI endpoints and Azure API Management☆92Updated last year
- Scaling AOAI using APIM, PTUs and TPMs☆109Updated last year
- Enterprise Azure OpenAI Hub provides prescriptive architecture and guidance to accelerate Generative AI on Azure for all organisations, i…☆85Updated 8 months ago
- Azure Machine Learning Cheat Sheets☆37Updated 2 years ago
- Automated Retrieval and GPT Understanding System by utilizing Azure Document Intelligence in combination with GPT models.☆114Updated 2 months ago
- A simple chat application that integrates Microsoft Entra ID for user authentication. Designed for deployment on Azure Container Apps wit…☆60Updated last week
- Azure MLOps (v2) solution accelerators. Enterprise ready templates to deploy your machine learning models on the Azure Platform.☆80Updated last year
- ☆183Updated last year
- The GPT-RAG Data Ingestion service automates processing of diverse documents—PDFs, images, spreadsheets, transcripts, and SharePoint—read…☆141Updated last week
- This article shows how to deploy an Azure Kubernetes Service(AKS) cluster and Azure OpenAI Service via Terraform and how to deploy a Terr…☆50Updated last year
- Unleash the power of Azure AI to your application developers in a secure & manageable way with Azure API Management and Azure Developer C…☆101Updated 4 months ago
- Samples to demonstrate pathways for Retrieval Augmented Generation (RAG) for Azure Data☆171Updated 11 months ago
- ☆54Updated last year
- This repo contains the reference implementation for the Microsoft Learn Azure AI Foundry end to end chat baseline.☆158Updated 3 weeks ago
- An AI Control Centre for monitoring, authenticating, and providing resilient access to multiple Open AI services.☆91Updated last month
- OpenAI powered document scanner☆12Updated last year
- ☆22Updated last year
- A demo app showcasing Vector Search using Azure AI Search, Azure OpenAI for text embeddings, and Azure AI Vision for image embeddings.☆74Updated 10 months ago
- An extension that adds support for Azure OpenAI/ OpenAI bindings in Azure Functions for LLM (GPT-3.5-Turbo, GPT-4, etc)☆99Updated 3 months ago
- Immersive workshop showcasing the remarkable potential of integrating SoTA foundation models to enhance product experiences and streamlin…☆194Updated 4 months ago
- ☆52Updated 9 months ago
- Lightweight library demonstrating how to create agenting application without using any specific framework.☆26Updated 8 months ago