High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
☆3,661Mar 19, 2026Updated this week
Alternatives and similar repositories for FastDeploy
Users that are interested in FastDeploy are comparing it to the libraries listed below
Sorting:
- 🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉☆4,387Feb 25, 2026Updated 3 weeks ago
- PaddleSlim is an open-source library for deep model compression and architecture search.☆1,614Jan 4, 2026Updated 2 months ago
- Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-ti…☆14,111Updated this week
- ONNX Model Exporter for PaddlePaddle☆905Jan 13, 2026Updated 2 months ago
- 🚀🚀🚀 YOLO series of PaddlePaddle implementation, PP-YOLOE+, RT-DETR, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv10, YOLO11, YOLOX, YOLOv5u, Y…☆661Jan 14, 2026Updated 2 months ago
- Implementation of popular deep learning networks with TensorRT network definition API☆7,720Mar 7, 2026Updated last week
- A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D …☆637Apr 22, 2025Updated 10 months ago
- OpenMMLab Model Deployment Framework☆3,108Sep 30, 2024Updated last year
- All-in-One Development Tool based on PaddlePaddle☆6,079Updated this week
- ☆2,289Apr 9, 2025Updated 11 months ago
- Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentat…☆9,314Feb 5, 2026Updated last month
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆22,908Updated this week
- PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)☆7,236May 22, 2025Updated 9 months ago
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,661Jan 22, 2026Updated last month
- 一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework☆1,762Updated this week
- C++ library based on tensorrt integration☆2,862May 24, 2023Updated 2 years ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,800Mar 9, 2026Updated last week
- ☆268Nov 20, 2025Updated 4 months ago
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,787Mar 28, 2024Updated last year
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆72,234Updated this week
- A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)☆925Feb 20, 2026Updated 3 weeks ago
- A treasure chest for visual classification and recognition powered by PaddlePaddle☆5,785Oct 27, 2025Updated 4 months ago
- Easy-to-use and powerful LLM and SLM library with awesome model zoo.☆12,930Dec 17, 2025Updated 3 months ago
- PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)☆23,762Updated this week
- Deep learning model converter for PaddlePaddle. (『飞桨』深度学习模型转换工具)☆770Oct 22, 2025Updated 4 months ago
- 飞桨智能标注,让标注快人一步☆294Nov 25, 2024Updated last year
- YOLOv8 using TensorRT accelerate !☆1,747Apr 30, 2025Updated 10 months ago
- MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.☆14,533Updated this week
- Effortless data labeling with AI support from Segment Anything and other awesome models.☆8,420Mar 9, 2026Updated last week
- YOLOv6: a single-stage object detection framework dedicated to industrial applications.☆5,875Aug 7, 2024Updated last year
- YOLOv3、YOLOv4、YOLOv5、YOLOv5-Lite、YOLOv6-v1 、YOLOv6-v2、YOLOv7、YOLOX、YOLOX-Lite、PP-YOLOE、PP-PicoDet-Plus、YOLO-Fastest v2、FastestDet、YOLOv5-S…☆765Oct 25, 2022Updated 3 years ago
- A primitive library for neural network☆1,367Nov 24, 2024Updated last year
- Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pret…☆717Mar 6, 2026Updated last week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆19,568Updated this week
- PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.☆12,988Updated this week
- tensorrt for yolo series (YOLOv11,YOLOv10,YOLOv9,YOLOv8,YOLOv7,YOLOv6,YOLOX,YOLOv5), nms plugin support☆1,146Oct 15, 2025Updated 5 months ago
- ONNX-TensorRT: TensorRT backend for ONNX☆3,194Feb 3, 2026Updated last month
- TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is …☆4,626May 9, 2025Updated 10 months ago
- NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone…☆6,171Aug 8, 2024Updated last year