How to quickly serve an LLM using Fast API, Celery, and Redis
☆16Aug 29, 2023Updated 2 years ago
Alternatives and similar repositories for FastAPI-LLM-Model-Serving
Users that are interested in FastAPI-LLM-Model-Serving are comparing it to the libraries listed below
Sorting:
- A demonstration of using LangServe to create an API from a LCEL Chain!☆15Dec 14, 2023Updated 2 years ago
- A collection of fine-tuning notebooks!☆30Oct 5, 2023Updated 2 years ago
- ☆11May 25, 2021Updated 4 years ago
- Machine Learning Preprocessing CLI☆13Jun 17, 2024Updated last year
- CRUD with Authentication and Authorization using Get x cli pattern and Supabase☆12Nov 5, 2023Updated 2 years ago
- Big Data Inventory Management on AWS (Demand Forecasting, Machine Learning, Dashboarding) : Presented at Carlson School of Management dur…☆11Apr 15, 2020Updated 5 years ago
- Docker powered container for using Nginx as reverse-proxy in combination with an OpenVPN Client.☆11Jan 1, 2020Updated 6 years ago
- Hanya dokumentasi bagaimana menggunakan opencv pada python.☆12Updated this week
- Depenency free (so far) Vanilla JS Dashboard UI for the mediamtx streaming server. Dockerized.☆32Feb 2, 2026Updated last month
- cpp rotation album,基于cpp eigen实现的3d旋转相册,GAMES101复现内容☆12Jul 25, 2022Updated 3 years ago
- Internet of Things Programming Projects, 2nd Edition, published by Packt☆12Jun 11, 2024Updated last year
- ☆16Jan 1, 2025Updated last year
- End to End Sales Streaming Pipeline (FastAPI, Kafka, Spark, Cassandra, MySQL, Superset)☆10May 26, 2023Updated 2 years ago
- Implemeting Meta AI's VGGT as a FiftyOne Remote Zoo Model☆20Jun 20, 2025Updated 8 months ago
- CLV prediction with pareto-NBD model☆12Jul 1, 2016Updated 9 years ago
- All-in-one Speech Transcription☆10Jan 25, 2026Updated last month
- ☆18Feb 7, 2026Updated 3 weeks ago
- A starter kit for building secure ai agents on Cloudflare with Auth0☆20Dec 4, 2025Updated 2 months ago
- Accelerating LLM inference with techniques like speculative decoding, quantization, and kernel fusion, focusing on implementing state-of-…☆11Jul 1, 2025Updated 8 months ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- ☆10Apr 12, 2021Updated 4 years ago
- IaC scripts for Microsoft Fabric capacity☆11Jan 17, 2024Updated 2 years ago
- Fast and Computationally efficient Continual Learning for NanoDet anchor-free Object Detector☆12Dec 16, 2024Updated last year
- template repository of CRISP-DM☆14Feb 6, 2023Updated 3 years ago
- Python environment setup and customizations.☆12Apr 27, 2021Updated 4 years ago
- Albumentations Data Augmentation Plugin for FiftyOne!☆14Aug 22, 2024Updated last year
- mobileNet SSD 基于caffe的前向检测☆10Nov 30, 2018Updated 7 years ago
- Champion at Brainhack TIL 2023: Team 10000SGDMRT☆18May 29, 2024Updated last year
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Sep 4, 2023Updated 2 years ago
- AZ-204 Developing Solutions for Microsoft Azure, by Packt Publishing☆13Nov 13, 2023Updated 2 years ago
- Jetbot Voice to Action Tools is a set of ROS2 nodes that utilize the Jetson Automatic Speech Recognition (ASR) deep learning interface li…☆13Feb 6, 2026Updated 3 weeks ago
- Streamlit Cookbook, published by Packt☆14Jun 6, 2025Updated 8 months ago
- This is the base starter for kicking off your Nextjs project with Reflexjs.☆10Apr 15, 2021Updated 4 years ago
- This repo is designed to teach an introduction to Node-RED☆13Oct 18, 2019Updated 6 years ago
- ☆13Updated this week
- SODA ERC-721 smart contract☆10Dec 31, 2021Updated 4 years ago
- Scripts to prepare OXFORD VGG Face dataset☆12Mar 29, 2016Updated 9 years ago
- The Purpose of this repository is to create a DeepStream/Triton-Server sample application that utilizes yolov7, yolov7-qat, yolov9 models…☆12Apr 1, 2024Updated last year
- Implementation of Computer Vision Models in JAX (equinox)☆18Jan 15, 2026Updated last month