cgrpa / AzureOAIBalancer
AzureAIOBalancer is a Terraform repository for automating the deployment of a load-balanced Azure OpenAI environment across multiple regions. It aims to increase throughput and manage high request volumes for any OpenAI model by distributing workloads evenly.
☆10Updated last year
Alternatives and similar repositories for AzureOAIBalancer:
Users that are interested in AzureOAIBalancer are comparing it to the libraries listed below
- ☆28Updated 7 months ago
- ☆46Updated this week
- Open AI with Private Endpoints behind APIM and functionality to get tokens consumption for each consumer☆32Updated 8 months ago
- This sample demonstrates how to load balance requests between multiple Azure OpenAI Services using Azure API Management.☆34Updated 11 months ago
- ☆45Updated 2 months ago
- Indexing Sharepoint Online Content to Azure Cognitive Search☆28Updated 4 months ago
- Smart load balancing for Azure OpenAI endpoints☆77Updated 11 months ago
- A demo app showcasing Vector Search using Azure AI Search, Azure OpenAI for text embeddings, and Azure AI Vision for image embeddings.☆72Updated 4 months ago
- ☆16Updated last month
- Reference architecture that provides a set of guidelines and best practices for implementing a central AI API gateway to empower various …☆21Updated 10 months ago
- ☆29Updated last month
- Model Context Protocol Servers for Azure AI Search☆29Updated 2 weeks ago
- ☆35Updated last year
- ☆38Updated last week
- Workbook for Azure OpenAI service☆64Updated 2 months ago
- Demonstrates how to build a streaming enterprise agent using Azure AI Agent Service. This sample integrates local HR and company policy d…☆75Updated last month
- A minimal speech-to-structured output app built with Azure OpenAI Realtime API.☆22Updated 3 months ago
- A description of the HTTP protocol used by multiple Azure AI chat solutions.☆32Updated 9 months ago
- A simple chat application that integrates Microsoft Entra ID for user authentication. Designed for deployment on Azure Container Apps wit…☆53Updated this week
- This repository contains a collection of Bicep modules designed to deploy a secure Azure AI Foundry environment with robust network and i…☆55Updated 3 months ago
- This repo contains the reference implementation for the Microsoft Learn Azure OpenAI end to end chat baseline.☆122Updated 3 weeks ago
- Generative AI Operations Solution Accelerator☆29Updated 5 months ago
- Lightweight library demonstrating how to create agenting application without using any specific framework.☆26Updated 2 months ago
- Build a Retrieval Augmented Generation solution using OpenAI, Azure Functions, Azure Static Web Apps, Azure SQL DB, Data API builder and …☆42Updated last month
- ☆24Updated last week
- ☆37Updated 6 months ago
- A repo to accelerate development and testing of GenAI Gateways built with Azure API Management. Includes various capabilities as examples…☆47Updated 2 months ago
- Streamlit-based UI Demo Kit app to showcase capabilities of Azure AI Foundry's Agent Service framework.☆28Updated 3 months ago
- A variation of the Azure OpenAI chat baseline, altered to be deployed specifically into an application landing zone.☆51Updated 3 weeks ago
- Demo application to show how to use Azure AI Document Intelligence and Azure OpenAI Service to increase the efficiency of document analys…☆57Updated 9 months ago