Azure / The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-Applications
There are many articles that cover the principles of reducing latency optimization for LLMs, however it is often unclear how to actually implement these principles. This repository provides practical techniques for reducing the latency of GenAI applications.
☆27Updated 11 months ago
Alternatives and similar repositories for The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-Applications:
Users that are interested in The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-Applications are comparing it to the libraries listed below
- Interactive workflows for creating AI intelligence reports from real-world data sources☆73Updated last month
- ☆92Updated last month
- ☆28Updated 11 months ago
- This sample shows how to quickly get started with LlamaIndex.ai on Azure🚀☆56Updated last month
- An end-to-end sample of RAG showcasing development, evaluation, experimentation, and deployment using Promptflow, search products like Co…☆51Updated 7 months ago
- Some python code samples using Azure AI Search for Generative AI stuff☆57Updated 3 months ago
- Multi-modal & multi-domain customer service agent with real time text, voice and soon video☆53Updated last week
- An easy way to deploy the Langfuse observability platform to Azure Container Apps with Entra authentication.☆50Updated 4 months ago
- Service to import data from various sources and index it in AI Search. Increases data relevance and reduces final size by 90%+. Useful fo…☆28Updated 6 months ago
- A multimodal Retrieval Augmented Generation with code execution capabilities. Process multiple complex documents with images, table, char…☆53Updated last month
- ☆57Updated 2 months ago
- Learn how to build solutions with Large Language Models.☆146Updated 7 months ago
- ☆103Updated 2 weeks ago
- This repo helps you to build a team of AI agents with Autogen☆178Updated last week
- The “Agentic Cookbook for Generative AI Agent usage” is a comprehensive guide designed to empower users with the knowledge and tools to e…☆107Updated last month
- The Multi-Agent Custom Automation Engine Solution Accelerator is an AI-driven orchestration system that manages a group of AI agents to a…☆181Updated this week
- Example for Deploying Chatbot using Streamlit and Azure Web App☆48Updated last year
- GenAIOps with Prompt Flow is a "GenAIOps template and guidance" to help you build LLM-infused apps using Prompt Flow. It offers a range o…☆317Updated 2 weeks ago
- The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluati…☆244Updated 2 weeks ago
- ☆79Updated 2 weeks ago
- The Azure AI Assistant Tool is experimental Python application and middleware designed to simplify the development, experimentation, test…☆136Updated last month
- Building LLM-Enabled Multi Agent Applications with AutoGen☆118Updated 2 weeks ago
- Legal Research Copilot Example Solution built with Generative AI capabilities of PostgreSQL on Azure☆69Updated last month
- Model Context Protocol Servers for Azure AI Search☆37Updated last month
- A recipe that will walk you through using either Meta Llama 3.1 405B or GPT-4o deployed on Azure AI to generate a synthetic dataset using…☆59Updated 2 months ago
- Azure OpenAI integration as a custom skillset in Azure Cognitive Search☆33Updated 2 years ago
- AutoGen data analysis☆32Updated last year
- Monitors and processes traffic to and from Azure OpenAI endpoints.☆105Updated this week
- ☆53Updated last month
- Virtual focus group with custom personas, product details, and final analysis created with AutoGen, Ollama/Llama3, and Streamlit.☆45Updated 9 months ago