cgrpa / AzureOAIBalancer
AzureAIOBalancer is a Terraform repository for automating the deployment of a load-balanced Azure OpenAI environment across multiple regions. It aims to increase throughput and manage high request volumes for any OpenAI model by distributing workloads evenly.
☆10Updated last year
Alternatives and similar repositories for AzureOAIBalancer:
Users that are interested in AzureOAIBalancer are comparing it to the libraries listed below
- A demo app showcasing Vector Search using Azure AI Search, Azure OpenAI for text embeddings, and Azure AI Vision for image embeddings.☆72Updated 5 months ago
- Build a Retrieval Augmented Generation solution using OpenAI, Azure Functions, Azure Static Web Apps, Azure SQL DB, Data API builder and …☆42Updated 2 months ago
- Smart load balancing for Azure OpenAI endpoints☆77Updated last year
- ☆46Updated 3 months ago
- Demo application to show how to use Azure AI Document Intelligence and Azure OpenAI Service to increase the efficiency of document analys…☆58Updated 10 months ago
- This sample demonstrates how to load balance requests between multiple Azure OpenAI Services using Azure API Management.☆34Updated last year
- This sample shows how to take a ChatGPT prompt as HTTP Get or Post input, calculates the completions using OpenAI ChatGPT service, all ho…☆19Updated 4 months ago
- ☆23Updated last year
- ☆28Updated 8 months ago
- A sample exploring ergonomic, lightweight multi-agent orchestration in Python using Azure Cosmos DB with OpenAI Swarm☆19Updated 3 weeks ago
- Responses API on Azure OpenAI samples☆15Updated this week
- Session demos for Build AI Apps at Fabric Conference 2024☆9Updated 9 months ago
- Sample chat application using Azure Cosmos DB for NoSQL, Azure OpenAI, Azure Container Apps, and Azure Container Registry☆11Updated last year
- ☆37Updated 7 months ago
- This repository contains a collection of Bicep modules designed to deploy a secure Azure AI Foundry environment with robust network and i…☆54Updated 4 months ago
- ☆36Updated last year
- Open AI with Private Endpoints behind APIM and functionality to get tokens consumption for each consumer☆33Updated 8 months ago
- A simple chat application that integrates Microsoft Entra ID for user authentication. Designed for deployment on Azure Container Apps wit …☆55Updated 2 weeks ago
- Durable Multi Agents is an attempt to make use of Azure Durable Functions with Semantic Kernel to build Multi-Agents workflows.☆39Updated last month
- Quickly generate embeddings from data in Azure SQL☆25Updated 2 months ago
- ☆51Updated 2 weeks ago
- Perform entity extraction using Azure OpenAI structured outputs☆41Updated this week
- A variation of the Azure OpenAI chat baseline, altered to be deployed specifically into an application landing zone.☆51Updated last month
- Secure Dynamic Session Agent for AI generated code execution built using OpenAI, Langchain and Azure Container Apps☆20Updated 4 months ago
- This repo contains the reference implementation for the Microsoft Learn Azure OpenAI end to end chat baseline.☆123Updated last month
- Unleash the power of Azure OpenAI to your application developers in a secure & manageable way with Azure API Management and Azure Develop…☆34Updated last year
- ☆21Updated last week
- Using Azure SQL and Semantic Kernel to chat with your own data using a mix of NL2SQL and RAG☆61Updated last month
- ☆19Updated 2 months ago
- Ecosystem of Microsoft AI Services☆26Updated last year