Azure / The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-ApplicationsLinks
There are many articles that cover the principles of reducing latency optimization for LLMs, however it is often unclear how to actually implement these principles. This repository provides practical techniques for reducing the latency of GenAI applications.
☆34Updated last year
Alternatives and similar repositories for The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-Applications
Users that are interested in The-LLM-Latency-Guidebook-Optimizing-Response-Times-for-GenAI-Applications are comparing it to the libraries listed below
Sorting:
- ☆28Updated last year
- This sample shows how to quickly get started with LlamaIndex.ai on Azure🚀☆61Updated 6 months ago
- Guide for designing adaptive, scalable, and secure enterprise multi-agent systems☆161Updated 2 months ago
- Building LLM-Enabled Multi Agent Applications from Scratch☆353Updated last week
- ☆30Updated last year
- An easy way to deploy the Langfuse observability platform to Azure Container Apps with Entra authentication.☆58Updated 6 months ago
- This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Dat…☆59Updated 10 months ago
- An end-to-end sample of RAG showcasing development, evaluation, experimentation, and deployment using Promptflow, search products like Co…☆56Updated last year
- ☆125Updated last week
- Some python code samples using Azure AI Search for Generative AI stuff☆67Updated last year
- Learn how to build solutions with Large Language Models.☆161Updated last year
- GenAIOps with Prompt Flow is a "GenAIOps template and guidance" to help you build LLM-infused apps using Prompt Flow. It offers a range o…☆352Updated 9 months ago
- The GPT-RAG Data Ingestion service automates processing of diverse documents—PDFs, images, spreadsheets, transcripts, and SharePoint—read…☆163Updated last week
- The GPT-RAG Orchestrator service is an agentic orchestration layer built on Azure AI Foundry Agent Service and the Semantic Kernel framew…☆77Updated last week
- A multimodal Retrieval Augmented Generation with code execution capabilities. Process multiple complex documents with images, table, char…☆79Updated 3 weeks ago
- Build secure LangChain applications on Azure☆113Updated 3 weeks ago
- A mixture of Gen AI cookbook recipes for Gen AI applications.☆232Updated last year
- Example for Deploying Chatbot using Streamlit and Azure Web App☆53Updated 2 years ago
- ☆119Updated last month
- A recipe that will walk you through using either Meta Llama 3.1 405B or OpenAI GPT-4o deployed on Azure AI to generate a synthetic datase…☆78Updated 6 months ago
- This solution converts speech to text and then processes and summarizes the text based on the prompt scenario.☆39Updated last year
- The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluati…☆295Updated 10 months ago
- ☆55Updated 7 months ago
- This repo helps you to build a team of AI agents with Autogen☆230Updated this week
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Updated 7 months ago
- An index of all of our weekly concepts + code events for aspiring AI Engineers and Business Leaders!!☆99Updated this week
- A backend for a chat application written in Python FastAPI framework☆64Updated 4 months ago
- Assistant API to chat with tabular data and perform analytics in natural language.☆56Updated last year
- Automating Advanced Business Analytics with ChatGPT☆68Updated 2 years ago
- Interactive workflows for creating AI intelligence reports from real-world data sources☆102Updated 2 weeks ago