High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
☆3,700Jun 24, 2026Updated this week
Alternatives and similar repositories for FastDeploy
Users that are interested in FastDeploy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉☆4,412Mar 19, 2026Updated 3 months ago
- PaddleSlim is an open-source library for deep model compression and architecture search.☆1,613Jan 4, 2026Updated 5 months ago
- Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-ti…☆14,276May 28, 2026Updated last month
- ONNX Model Exporter for PaddlePaddle☆933Mar 18, 2026Updated 3 months ago
- 🚀🚀🚀 YOLO series of PaddlePaddle implementation, PP-YOLOE+, RT-DETR, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv10, YOLO11, YOLOX, YOLOv5u, Y…☆667Jan 14, 2026Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Implementation of popular deep learning networks with TensorRT network definition API☆7,806Updated this week
- A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D …☆642Apr 22, 2025Updated last year
- FlyCV is a high-performance library for processing computer visual tasks.☆597Jun 2, 2023Updated 3 years ago
- OpenMMLab Model Deployment Framework☆3,127Sep 30, 2024Updated last year
- All-in-One Development Tool based on PaddlePaddle☆6,172Updated this week
- Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentat…☆9,350Feb 5, 2026Updated 4 months ago
- ☆2,560Apr 9, 2025Updated last year
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆23,427Updated this week
- PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)☆7,260Apr 27, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,694May 28, 2026Updated last month
- 一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework☆1,832Apr 25, 2026Updated 2 months ago
- C++ library based on tensorrt integration☆2,882May 24, 2023Updated 3 years ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆13,102Updated this week
- ☆269Nov 20, 2025Updated 7 months ago
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,803Mar 28, 2024Updated 2 years ago
- A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)☆921Feb 20, 2026Updated 4 months ago
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆83,904Updated this week
- A treasure chest for visual classification and recognition powered by PaddlePaddle☆5,816Jun 23, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Easy-to-use and powerful LLM and SLM library with awesome model zoo.☆12,955May 23, 2026Updated last month
- PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)☆23,983Jun 23, 2026Updated last week
- Deep learning model converter for PaddlePaddle. (『飞桨』深度学习模型转换工具)☆772Oct 22, 2025Updated 8 months ago
- 飞桨智能标注,让标注快人一步☆298Nov 25, 2024Updated last year
- A primitive library for neural network☆1,367Nov 24, 2024Updated last year
- YOLOv8 using TensorRT accelerate !☆1,790Jun 10, 2026Updated 2 weeks ago
- MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.☆15,556Updated this week
- Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pret…☆725Mar 6, 2026Updated 3 months ago
- YOLOv3、YOLOv4、YOLOv5、YOLOv5-Lite、YOLOv6-v1、YOLOv6-v2、YOLOv7、YOLOX、YOLOX-Lite、PP-YOLOE、PP-PicoDet-Plus、YOLO-Fastest v2、FastestDet、YOLOv5-S…☆767Oct 25, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- YOLOv6: a single-stage object detection framework dedicated to industrial applications.☆5,883Aug 7, 2024Updated last year
- Effortless data labeling with AI support from Segment Anything and other awesome models.☆9,513Jun 20, 2026Updated last week
- PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.☆12,984Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆20,893Jun 23, 2026Updated last week
- ONNX-TensorRT: TensorRT backend for ONNX☆3,212Jun 22, 2026Updated last week
- tensorrt for yolo series (YOLOv11,YOLOv10,YOLOv9,YOLOv8,YOLOv7,YOLOv6,YOLOX,YOLOv5), nms plugin support☆1,163Oct 15, 2025Updated 8 months ago
- TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is …☆4,641May 9, 2025Updated last year