AI-Maker-Space / FastAPI-LLM-Model-ServingLinks
How to quickly serve an LLM using Fast API, Celery, and Redis
β16Updated 2 years ago
Alternatives and similar repositories for FastAPI-LLM-Model-Serving
Users that are interested in FastAPI-LLM-Model-Serving are comparing it to the libraries listed below
Sorting:
- Fine-tune an LLM to perform batch inference and online serving.β112Updated 3 months ago
- π» Decoding ML articles hub: Hands-on articles with code on production-grade MLβ139Updated 6 months ago
- GenAI Experimentationβ57Updated 3 weeks ago
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS toolingβ134Updated 10 months ago
- A collection of all available inference solutions for the LLMsβ91Updated 6 months ago
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsβ314Updated 2 months ago
- Sample notebooks and prompts for LLM evaluationβ138Updated 3 months ago
- Self-host LLMs with vLLM and BentoMLβ149Updated last week
- Building Private Healthcare AI Assistant for Clinics Using Qdrant Hybrid Cloud, DSPy and Groq - Llama3β22Updated last year
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)β12Updated last year
- Find the optimal model serving solution for π€ Hugging Face models πβ44Updated last month
- Notebooks and Code about Generative Ai, LLMs, MLOPS, NLP , CV and Graph databasesβ125Updated last week
- Sales Conversion Optimization MLOps: Boost revenue with AI-powered insights. Features H2O AutoML, ZenML pipelines, Neptune.ai tracking, dβ¦β18Updated 5 months ago
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated 2 years ago
- Various installation guides for Large Language Modelsβ74Updated 4 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging coβ¦β113Updated last year
- Unlock the potential of finetuning Large Language Models (LLMs). Learn from industry expert, and discover when to apply finetuning, data β¦β68Updated last year
- Examples of using Evidently to evaluate, test and monitor ML models.β39Updated last month
- This repository will contain the presentation and python jupyter notebooks for the DataHack Summit 2024 conference talk, Improving Real-wβ¦β121Updated 11 months ago
- Miscellaneous codes and writings for MLOpsβ15Updated last week
- π Use NVIDIA NIMs with Haystack pipelinesβ32Updated last year
- Various projects using Large Language Model (GPT & LLAMA) other open source model from HuggingFace and OpenAI. OpenAI API required for ruβ¦β104Updated last month
- A collection of hand on notebook for LLMs practitionerβ50Updated 8 months ago
- A Hands-on Practical Guide to LlamaIndexβ33Updated 11 months ago
- A repository for all ZenML projects that are specific production use-cases.β277Updated 3 weeks ago
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β128Updated last month
- β20Updated last year
- GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extendedβ33Updated 2 years ago
- β63Updated 5 months ago
- β58Updated last year