microsoft / azureml-inference-server
The AzureML Inference Server is a python package that allows user to easily expose machine learning models as HTTP Endpoints. The server is included by default in AzureML's pre-built docker images for inference.
☆23Updated last month
Related projects: ⓘ
- Solution Accelerator to show how to build your own copilot☆105Updated this week
- Business process automation solution accelerator using Azure services☆166Updated last month
- Immersive workshop showcasing the remarkable potential of integrating SoTA foundation models to enhance product experiences and streamlin…☆156Updated this week
- ☆34Updated this week
- Function calling for vector database lookup based on user question☆20Updated this week
- Learn how to build solutions with Large Language Models.☆111Updated 3 weeks ago
- The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluati…☆174Updated this week
- Assistant API to chat with tabular data and perform analytics in natural language.☆29Updated 3 weeks ago
- This sample shows how to take text documents as a input via BlobTrigger, does Text Summarization processing using the AI Congnitive Langu…☆17Updated 4 months ago
- Reddog Cloud Native Solutions☆56Updated last month
- A demo app showcasing Vector Search using Azure AI Search, Azure OpenAI for text embeddings, and Azure AI Vision for image embeddings.☆66Updated 3 months ago
- Scalable solution for ML Observability☆34Updated 3 months ago
- AI Hub Executive Demo HandsOn☆45Updated 2 weeks ago
- ☆66Updated 11 months ago
- Monitors and processes traffic to and from Azure OpenAI endpoints.☆99Updated this week
- The Azure AI proxy service facilitates easy access to Azure AI resources for workshops and hackathons. It offers a Playground-like interf…☆55Updated last week
- An AI Control Centre for monitoring, authenticating, and providing resilient access to multiple Open AI services.☆80Updated last week
- Samples to demonstrate pathways for Retrieval Augmented Generation (RAG) for Azure Data☆110Updated last month
- Smart load balancing for OpenAI endpoints and Azure API Management☆60Updated 3 months ago
- ☆36Updated this week
- This sample demonstrates how to load balance requests between multiple Azure OpenAI Services using Azure API Management.☆32Updated 5 months ago
- A variation of the Azure OpenAI chat baseline, altered to be deployed specifically into an application landing zone.☆24Updated 4 months ago
- Microsoft Official Build Modern AI Apps reference solutions and content. Demonstrate how to build Copilot applications that incorporate H…☆151Updated last month
- A 1-2 day hackathon to help users learn the concepts and technical skills to build AI-enabled applications and services in Azure.☆57Updated 4 months ago
- Extensible Semantic Kernel Bot Solution Accelerator☆53Updated 5 months ago
- ☆49Updated 3 months ago
- A versatile tool designed to help prototype intelligent assistants, agents and multi-agentic systems☆52Updated this week
- A description of the HTTP protocol used by multiple Azure AI chat solutions.☆30Updated 3 months ago
- ☆15Updated last month
- A simple web application for a OpenAI-enabled document search. This repo uses Azure OpenAI Service for creating embeddings vectors from d…☆44Updated 8 months ago