AI-Maker-Space / FastAPI-LLM-Model-ServingLinks
How to quickly serve an LLM using Fast API, Celery, and Redis
☆15Updated last year
Alternatives and similar repositories for FastAPI-LLM-Model-Serving
Users that are interested in FastAPI-LLM-Model-Serving are comparing it to the libraries listed below
Sorting:
- Fine-tune an LLM to perform batch inference and online serving.☆111Updated last week
- A collection of hand on notebook for LLMs practitioner☆47Updated 4 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆105Updated 2 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆110Updated 10 months ago
- GenAI Experimentation☆57Updated last month
- Example code and notebooks related to mlflow, llmops, etc.☆43Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- 🚀 Use NVIDIA NIMs with Haystack pipelines☆31Updated 9 months ago
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated last year
- A collection of fine-tuning notebooks!☆27Updated last year
- DSPY on action with OpenSource LLMs.☆70Updated last year
- Building Private Healthcare AI Assistant for Clinics Using Qdrant Hybrid Cloud, DSPy and Groq - Llama3☆22Updated last year
- 3-Pipeline LLMOps Financial advisor. Steaming pipeline deployed on AWS, 24/7 collects, embeds live-data into QdrantDB. Training pipeline …☆23Updated last month
- Document Q&A on Wikipedia articles using LLMs☆78Updated last year
- ☆18Updated 3 weeks ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆11Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆70Updated 7 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated 10 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆111Updated 8 months ago
- A library for evaluating Retrieval-Augmented Generation (RAG) systems (The traditional ways).☆35Updated 9 months ago
- How far can we go with an LLM for a classification problem☆25Updated 6 months ago
- All code related to medium articles☆17Updated 2 weeks ago
- Set of scripts to finetune LLMs☆37Updated last year
- Build Agentic workflows with function calling using open LLMs☆26Updated this week
- 💻 Decoding ML articles hub: Hands-on articles with code on production-grade ML☆131Updated 3 months ago
- Examples of using Evidently to evaluate, test and monitor ML models.☆28Updated last week
- ☆19Updated 7 months ago
- Retrieval Augmented Generation (RAG) on audio data with LangChain☆14Updated last year
- ☆14Updated last year
- Sample notebooks and prompts for LLM evaluation☆131Updated this week