High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
☆3,649Updated this week
Alternatives and similar repositories for FastDeploy
Users that are interested in FastDeploy are comparing it to the libraries listed below
Sorting:
- 🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉☆4,374Updated this week
- ONNX Model Exporter for PaddlePaddle☆901Jan 13, 2026Updated last month
- PaddleSlim is an open-source library for deep model compression and architecture search.☆1,612Jan 4, 2026Updated last month
- Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-ti…☆14,083Feb 13, 2026Updated 2 weeks ago
- 🚀🚀🚀 YOLO series of PaddlePaddle implementation, PP-YOLOE+, RT-DETR, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv10, YOLO11, YOLOX, YOLOv5u, Y…☆659Jan 14, 2026Updated last month
- Implementation of popular deep learning networks with TensorRT network definition API☆7,680Updated this week
- OpenMMLab Model Deployment Framework☆3,100Sep 30, 2024Updated last year
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,652Jan 22, 2026Updated last month
- All-in-One Development Tool based on PaddlePaddle☆6,038Feb 14, 2026Updated last week
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆22,819Feb 20, 2026Updated last week
- Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentat…☆9,305Feb 5, 2026Updated 3 weeks ago
- 一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework☆1,743Updated this week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,702Feb 13, 2026Updated 2 weeks ago
- ☆2,228Apr 9, 2025Updated 10 months ago
- C++ library based on tensorrt integration☆2,854May 24, 2023Updated 2 years ago
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,785Mar 28, 2024Updated last year
- ☆269Nov 20, 2025Updated 3 months ago
- A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D …☆634Apr 22, 2025Updated 10 months ago
- PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)☆7,227May 22, 2025Updated 9 months ago
- MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM …☆14,248Feb 16, 2026Updated last week
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆71,012Feb 16, 2026Updated last week
- YOLOv6: a single-stage object detection framework dedicated to industrial applications.☆5,876Aug 7, 2024Updated last year
- A primitive library for neural network☆1,367Nov 24, 2024Updated last year
- YOLOv3、YOLOv4、YOLOv5、YOLOv5-Lite、YOLOv6-v1、YOLOv6-v2、YOLOv7、YOLOX、YOLOX-Lite、PP-YOLOE、PP-PicoDet-Plus、YOLO-Fastest v2、FastestDet、YOLOv5-S…☆765Oct 25, 2022Updated 3 years ago
- Easy-to-use and powerful LLM and SLM library with awesome model zoo.☆12,912Dec 17, 2025Updated 2 months ago
- YOLOv8 using TensorRT accelerate !☆1,733Apr 30, 2025Updated 9 months ago
- A treasure chest for visual classification and recognition powered by PaddlePaddle☆5,780Oct 27, 2025Updated 4 months ago
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆19,389Updated this week
- TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is …☆4,619May 9, 2025Updated 9 months ago
- Effortless data labeling with AI support from Segment Anything and other awesome models.☆8,229Updated this week
- A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)☆922Feb 20, 2026Updated last week
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,618Updated this week
- 🔥🔥🔥🔥 (Earlier YOLOv7 not official one) YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! 🔥🔥🔥☆3,120Nov 18, 2023Updated 2 years ago
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,375Updated this week
- NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone…☆6,163Aug 8, 2024Updated last year
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, MNN, PaddlePaddle and PyTorch.☆5,980Feb 13, 2026Updated 2 weeks ago
- PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)☆23,642Updated this week
- ONNX-TensorRT: TensorRT backend for ONNX☆3,188Feb 3, 2026Updated 3 weeks ago
- YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documenta…☆10,341Jun 8, 2025Updated 8 months ago