Azure-Samples / shared-azure-openai-tpm

This example shows how a multitenant service can distribute requests evenly among multiple Azure OpenAI Service instances and manage tokens per minute (TPM) for multiple tenants.
11Updated 10 months ago

Related projects

Alternatives and complementary repositories for shared-azure-openai-tpm