microsoft / azureml-inference-serverLinks
The AzureML Inference Server is a python package that allows user to easily expose machine learning models as HTTP Endpoints. The server is included by default in AzureML's pre-built docker images for inference.
☆26Updated last month
Alternatives and similar repositories for azureml-inference-server
Users that are interested in azureml-inference-server are comparing it to the libraries listed below
Sorting:
- Immersive workshop showcasing the remarkable potential of integrating SoTA foundation models to enhance product experiences and streamlin…☆204Updated 2 months ago
- A simple chat application that integrates Microsoft Entra ID for user authentication. Designed for deployment on Azure Container Apps wit…☆65Updated last week
- Contains the Azure Media Services Video Indexer samples☆182Updated last week
- Scalable solution for ML Observability☆37Updated last year
- Smart load balancing for OpenAI endpoints and Azure API Management☆98Updated last year
- Monitors and processes traffic to and from Azure OpenAI endpoints.☆109Updated this week
- Smart load balancing for Azure OpenAI endpoints☆58Updated last year
- This sample demonstrates how to load balance requests between multiple Azure OpenAI Services using Azure API Management.☆35Updated last year
- Azure MLOps (v2) solution accelerators. Enterprise ready templates to deploy your machine learning models on the Azure Platform.☆82Updated 2 months ago
- This repo contains the reference implementation for the Microsoft Foundry chat baseline reference architecture on Microsoft Learn.☆184Updated last month
- Guided accelerator consolidating best practice patterns, IaaC and AML code artefacts to provide a reference approach to implementing MLOp…☆52Updated last year
- Scaling AOAI using APIM, PTUs and TPMs☆113Updated last year
- Azure OpenAI benchmarking tool☆151Updated last year
- Demo application to show how to use Azure AI Document Intelligence and Azure OpenAI Service to increase the efficiency of document analys…☆59Updated last year
- A repo to accelerate development and testing of GenAI Gateways built with Azure API Management. Includes various capabilities as examples…☆64Updated last year
- ☆32Updated last year
- This sample shows how to take a ChatGPT prompt as HTTP Get or Post input, calculates the completions using OpenAI ChatGPT service, all ho…☆21Updated 3 months ago
- Simple starting point function to host LangChains with LLMs and other tools in an Azure Function.☆73Updated last year
- A demo app showcasing Vector Search using Azure AI Search, Azure OpenAI for text embeddings, and Azure AI Vision for image embeddings.☆75Updated last year
- Demo for a Prompt Flow using OpenAI Assistants☆93Updated last year
- Code for MS Ignite 2023 | Live Breakout Session | BRK203☆32Updated 2 years ago
- An extension that adds support for Azure OpenAI/ OpenAI bindings in Azure Functions for LLM (GPT-3.5-Turbo, GPT-4, etc)☆99Updated 2 weeks ago
- Unleash the power of Azure AI to your application developers in a secure & manageable way with Azure API Management and Azure Developer C…☆108Updated 8 months ago
- Azure Search Python sample code☆170Updated this week
- ☆183Updated last year
- ☆54Updated 2 years ago
- Samples to demonstrate pathways for Retrieval Augmented Generation (RAG) for Azure Data☆175Updated 3 months ago
- Automated Retrieval and GPT Understanding System by utilizing Azure Document Intelligence in combination with GPT models.☆127Updated last week
- Azure OpenAI benchmarking tool☆28Updated 10 months ago
- Enterprise Azure OpenAI Hub provides prescriptive architecture and guidance to accelerate Generative AI on Azure for all organisations, i…☆85Updated last year