AI-Maker-Space / FastAPI-LLM-Model-ServingLinks
How to quickly serve an LLM using Fast API, Celery, and Redis
☆15Updated last year
Alternatives and similar repositories for FastAPI-LLM-Model-Serving
Users that are interested in FastAPI-LLM-Model-Serving are comparing it to the libraries listed below
Sorting:
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated last month
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆130Updated 8 months ago
- 💻 Decoding ML articles hub: Hands-on articles with code on production-grade ML☆133Updated 4 months ago
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆311Updated this week
- 🚀 Use NVIDIA NIMs with Haystack pipelines☆32Updated 10 months ago
- A repository for all ZenML projects that are specific production use-cases.☆266Updated this week
- Sample notebooks and prompts for LLM evaluation☆135Updated last month
- GenAI Experimentation☆57Updated this week
- A collection of all available inference solutions for the LLMs☆91Updated 4 months ago
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility and…☆124Updated 2 months ago
- This repository will contain the presentation and python jupyter notebooks for the DataHack Summit 2024 conference talk, Improving Real-w…☆117Updated 9 months ago
- A collection of fine-tuning notebooks!☆27Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 9 months ago
- Unlock the potential of finetuning Large Language Models (LLMs). Learn from industry expert, and discover when to apply finetuning, data …☆62Updated last year
- Marqo's Course on 'Fine-Tuning Embedding Models for Semantic Search'.☆45Updated 11 months ago
- Self-host LLMs with vLLM and BentoML☆134Updated 2 weeks ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆112Updated 11 months ago
- ☆34Updated 2 months ago
- Test LLMs automatically with Giskard and CI/CD☆30Updated 11 months ago
- ☆274Updated last month
- Notebooks and Code about Generative Ai, LLMs, MLOPS, NLP , CV and Graph databases☆118Updated last week
- ☆20Updated last year
- ☆185Updated last year
- A collection of architectural patterns leveraging Large Language Models (LLMs) for efficient Text-to-SQL generation.☆234Updated last year
- How far can we go with an LLM for a classification problem☆24Updated 7 months ago
- Building your first LLM application with OpenAI, and AI-assisted Development, step-by-step!☆98Updated last month
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆123Updated last week
- This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.☆164Updated last week
- Cookbooks for AI Agents☆145Updated 2 months ago