Azure / The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-ApplicationsLinks
There are many articles that cover the principles of reducing latency optimization for LLMs, however it is often unclear how to actually implement these principles. This repository provides practical techniques for reducing the latency of GenAI applications.
☆31Updated last year
Alternatives and similar repositories for The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-Applications
Users that are interested in The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-Applications are comparing it to the libraries listed below
Sorting:
- Guide for designing adaptive, scalable, and secure enterprise multi-agent systems☆136Updated last week
- ☆28Updated last year
- An end-to-end sample of RAG showcasing development, evaluation, experimentation, and deployment using Promptflow, search products like Co…☆55Updated last year
- This sample shows how to quickly get started with LlamaIndex.ai on Azure🚀☆60Updated 4 months ago
- The GPT-RAG Data Ingestion service automates processing of diverse documents—PDFs, images, spreadsheets, transcripts, and SharePoint—read…☆155Updated 2 weeks ago
- An easy way to deploy the Langfuse observability platform to Azure Container Apps with Entra authentication.☆57Updated 4 months ago
- Building LLM-Enabled Multi Agent Applications from Scratch☆274Updated last week
- Learn how to build solutions with Large Language Models.☆160Updated last year
- Interactive workflows for creating AI intelligence reports from real-world data sources☆97Updated last month
- This solution converts speech to text and then processes and summarizes the text based on the prompt scenario.☆37Updated last year
- The GPT-RAG Orchestrator service is an agentic orchestration layer built on Azure AI Foundry Agent Service and the Semantic Kernel framew…☆70Updated this week
- An index of all of our weekly concepts + code events for aspiring AI Engineers and Business Leaders!!☆93Updated last week
- ☆29Updated last year
- Virtual focus group with custom personas, product details, and final analysis created with AutoGen, Ollama/Llama3, and Streamlit.☆47Updated last year
- Example for Deploying Chatbot using Streamlit and Azure Web App☆53Updated 2 years ago
- The “Agentic Cookbook for Generative AI Agent usage” is a comprehensive guide designed to empower users with the knowledge and tools to e…☆138Updated 8 months ago
- Some python code samples using Azure AI Search for Generative AI stuff☆66Updated 10 months ago
- Legal Research Copilot Example Solution built with Generative AI capabilities of PostgreSQL on Azure☆100Updated 9 months ago
- ☆124Updated 2 months ago
- ☆108Updated 4 months ago
- ☆75Updated last year
- This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Dat…☆57Updated 8 months ago
- A fully python based Streamlit development harness for ChatGPT hosted in Azure OpenAI Service.☆53Updated 2 years ago
- A backend for a chat application written in Python FastAPI framework☆64Updated 2 months ago
- Assistant API to chat with tabular data and perform analytics in natural language.☆54Updated last year
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated 2 years ago
- GenAIOps with Prompt Flow is a "GenAIOps template and guidance" to help you build LLM-infused apps using Prompt Flow. It offers a range o…☆347Updated 7 months ago
- A mixture of Gen AI cookbook recipes for Gen AI applications.☆225Updated last year
- Building your first LLM application with OpenAI, and AI-assisted Development, step-by-step!☆114Updated 2 weeks ago
- The Azure AI Assistant Tool is experimental Python application and middleware designed to simplify the development, experimentation, test…☆139Updated 8 months ago