Azure / The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-ApplicationsLinks
There are many articles that cover the principles of reducing latency optimization for LLMs, however it is often unclear how to actually implement these principles. This repository provides practical techniques for reducing the latency of GenAI applications.
☆30Updated last year
Alternatives and similar repositories for The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-Applications
Users that are interested in The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-Applications are comparing it to the libraries listed below
Sorting:
- An end-to-end sample of RAG showcasing development, evaluation, experimentation, and deployment using Promptflow, search products like Co…☆53Updated last year
- This sample shows how to quickly get started with LlamaIndex.ai on Azure🚀☆58Updated last month
- An easy way to deploy the Langfuse observability platform to Azure Container Apps with Entra authentication.☆57Updated last month
- Automatically generate github documentation with readthedocs using your openai endpoint☆35Updated 2 years ago
- ☆28Updated last year
- GenAIOps with Prompt Flow is a "GenAIOps template and guidance" to help you build LLM-infused apps using Prompt Flow. It offers a range o…☆341Updated 4 months ago
- Learn how to build solutions with Large Language Models.☆157Updated 11 months ago
- ☆62Updated this week
- Tools for evaluation of RAG Chat Apps using Azure AI Evaluate SDK and OpenAI☆303Updated this week
- A recipe that will walk you through using either Meta Llama 3.1 405B or OpenAI GPT-4o deployed on Azure AI to generate a synthetic datase…☆69Updated last month
- This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Dat…☆53Updated 5 months ago
- Some python code samples using Azure AI Search for Generative AI stuff☆66Updated 7 months ago
- ☆129Updated this week
- A backend for a chat application written in Python FastAPI framework☆59Updated 2 weeks ago
- Example for Deploying Chatbot using Streamlit and Azure Web App☆52Updated 2 years ago
- Using Azure OpenAI GPT 4o to extract information such as text, tables and charts from Documents to Markdown☆30Updated 7 months ago
- This solution converts speech to text and then processes and summarizes the text based on the prompt scenario.☆35Updated 10 months ago
- The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluati…☆265Updated 4 months ago
- ☆102Updated last month
- ☆29Updated last year
- ☆121Updated 3 weeks ago
- This repo accelerates development of RAG applications with rich data sources including SQL Warehouses and documents analysed with Azure D…☆86Updated last month
- A multimodal Retrieval Augmented Generation with code execution capabilities. Process multiple complex documents with images, table, char…☆69Updated last month
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Updated 2 months ago
- Assistant API to chat with tabular data and perform analytics in natural language.☆53Updated last year
- A fully python based Streamlit development harness for ChatGPT hosted in Azure OpenAI Service.☆52Updated 2 years ago
- This repo helps you to build a team of AI agents with Autogen☆223Updated last week
- A mixture of Gen AI cookbook recipes for Gen AI applications.☆218Updated 11 months ago
- Azure Search Python sample code☆143Updated 2 months ago
- Azure OpenAI benchmarking tool☆148Updated last year