AI-Maker-Space / FastAPI-LLM-Model-ServingLinks

How to quickly serve an LLM using Fast API, Celery, and Redis

☆15

Alternatives and similar repositories for FastAPI-LLM-Model-Serving

Users that are interested in FastAPI-LLM-Model-Serving are comparing it to the libraries listed below

Sorting:

anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆112Updated last month
aishwaryaprabhat / goku
GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling
☆130Updated 8 months ago
decodingml / articles-code
💻 Decoding ML articles hub: Hands-on articles with code on production-grade ML
☆133Updated 4 months ago
deep-diver / llamaduo
[ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
☆311Updated this week
deepset-ai / rag-with-nvidia-nims
🚀 Use NVIDIA NIMs with Haystack pipelines
☆32Updated 10 months ago
zenml-io / zenml-projects
A repository for all ZenML projects that are specific production use-cases.
☆266Updated this week
rajshah4 / LLM-Evaluation
Sample notebooks and prompts for LLM evaluation
☆135Updated last month
jayita13 / GenerativeAI
GenAI Experimentation
☆57Updated this week
mani-kantap / llm-inference-solutions
A collection of all available inference solutions for the LLMs
☆91Updated 4 months ago
Sakil786 / LLM-PlayLab
This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility and…
☆124Updated 2 months ago
dipanjanS / improving-RAG-systems-dhs2024
This repository will contain the presentation and python jupyter notebooks for the DataHack Summit 2024 conference talk, Improving Real-w…
☆117Updated 9 months ago
AI-Maker-Space / Fine-tuning-LLM-Resources
A collection of fine-tuning notebooks!
☆27Updated last year
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆110Updated 9 months ago
ksm26 / Finetuning-Large-Language-Models
Unlock the potential of finetuning Large Language Models (LLMs). Learn from industry expert, and discover when to apply finetuning, data …
☆62Updated last year
marqo-ai / fine-tuning-embedding-models-course
Marqo's Course on 'Fine-Tuning Embedding Models for Semantic Search'.
☆45Updated 11 months ago
bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆134Updated 2 weeks ago
ibm-self-serve-assets / SuperKnowa
Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…
☆112Updated 11 months ago
lorenzejay / contract-analysis-use-case
☆34Updated 2 months ago
Paulescu / testing-llms-in-the-real-world
Test LLMs automatically with Giskard and CI/CD
☆30Updated 11 months ago
triton-inference-server / vllm_backend
☆274Updated last month
olonok69 / LLM_Notebooks
Notebooks and Code about Generative Ai, LLMs, MLOPS, NLP , CV and Graph databases
☆118Updated last week
CVxTz / llm-serve-tutorial
☆20Updated last year
run-llama / ai-engineer-workshop
☆185Updated last year
arunpshankar / LLM-Text-to-SQL-Architectures
A collection of architectural patterns leveraging Large Language Models (LLMs) for efficient Text-to-SQL generation.
☆234Updated last year
vincentclaes / classification-with-llm
How far can we go with an LLM for a classification problem
☆24Updated 7 months ago
AI-Maker-Space / The-AI-Engineer-Challenge
Building your first LLM application with OpenAI, and AI-assisted Development, step-by-step!
☆98Updated last month
amogkam / llama_index_ray
Using LlamaIndex with Ray for productionizing LLM applications
☆71Updated last year
alopatenko / LLMEvaluation
A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…
☆123Updated last week
NVIDIA-AI-Blueprints / rag
This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.
☆164Updated last week
ytang07 / ai_agents_cookbooks
Cookbooks for AI Agents
☆145Updated 2 months ago