This repo gives an introduction to how to make full working example to serve your model using asynchronous Celery tasks and FastAPI. 🔥 🔥 🔥 🔥
☆30May 21, 2024Updated 2 years ago
Alternatives and similar repositories for ml-models-in-production
Users that are interested in ml-models-in-production are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple example of FastAPI + Celery + Triton for benchmarking☆64Aug 11, 2022Updated 3 years ago
- This project is the backend engine for a fully autonomous AI-powered call center. It integrates a large language model (LLM), speech reco…☆22Apr 18, 2025Updated last year
- Notes and code for Programming Massively Parallel Processors☆13Mar 29, 2025Updated last year
- ☆15Jan 26, 2023Updated 3 years ago
- ☆24Mar 1, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of Nvidia DeepStream 7 with YOLOv9 Models.☆15Jun 22, 2024Updated last year
- A list of papers and other resources on computer vision and deep learning.☆26Sep 16, 2020Updated 5 years ago
- Hikvision Events is an interface between a Hikvision NVR and SmartThings that allows events, such as line crossings and motion detection …☆14Mar 25, 2021Updated 5 years ago
- How to quickly serve an LLM using Fast API, Celery, and Redis☆17Aug 29, 2023Updated 2 years ago
- Outbound Caller for Eleven Labs Conversational AI Agents☆22Jan 14, 2025Updated last year
- Cải thiện Elasticsearch trong bài toán semantic search sử dụng phương pháp Sentence Embeddings☆25May 27, 2021Updated 5 years ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆18Aug 30, 2024Updated last year
- This repository utilizes the Triton Inference Server Client, which streamlines the complexity of model deployment.☆21Sep 1, 2024Updated last year
- End-to-end ELT data engineering project☆23Dec 24, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Create WebRTC audio or video conference calls on using our Python SDK☆10Jun 18, 2024Updated last year
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Oct 5, 2023Updated 2 years ago
- PaddleOCR + OnnxRuntime☆16Oct 21, 2023Updated 2 years ago
- Code samples from twitch.tv/fhinkel☆11May 21, 2022Updated 4 years ago
- [AAAI 2026] Official implementation of the paper ”SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D F…☆53Jan 8, 2026Updated 4 months ago
- WACV2025☆33Mar 3, 2025Updated last year
- A PyTorch toolkit for 2D Human Pose Estimation.☆13Jan 4, 2019Updated 7 years ago
- ☆11Mar 30, 2025Updated last year
- Correction of spaces with character-based neural language models.☆13Aug 23, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PoC with FastAPI and Celery to ML inference☆96Dec 8, 2022Updated 3 years ago
- ☆12Jun 7, 2021Updated 4 years ago
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- 2D Vector-Quantized Auto-Encoder for compression of Whole-Slide Images in Histopathology☆16Jul 18, 2024Updated last year
- Tiny Agent: Production-Ready LLM Agent SDK for Every Developer☆41Sep 29, 2025Updated 8 months ago
- ☆31Oct 16, 2024Updated last year
- Timeline Summarization based on Event Graph Compression via Time-Aware Optimal Transport☆17Nov 8, 2021Updated 4 years ago
- Deforming 2d images using ARAP☆14Jun 15, 2022Updated 3 years ago
- Dữ liệu thô Điểm thi tốt nghiệp THPT 2022. (The 2022 high school graduation exam score raw data)☆43Feb 7, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Top 2 Solution for Zalo AI Challenge 2022 - Liveness Detection track☆43Dec 3, 2022Updated 3 years ago
- ☆25Dec 19, 2024Updated last year
- Your Game's Pipeline, Automated ⚡☆14May 20, 2026Updated last week
- My own Docker tutorial for self-learning☆94Jul 13, 2022Updated 3 years ago
- ☆31Jun 9, 2025Updated 11 months ago
- [ICME 2021 Oral] CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning☆19Jul 5, 2021Updated 4 years ago
- Demo of Agentic RAG☆38Aug 23, 2025Updated 9 months ago