Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
☆718Mar 6, 2026Updated 2 weeks ago
Alternatives and similar repositories for PaddleMIX
Users that are interested in PaddleMIX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆125Updated this week
- Paddle Automatically Diff Precision Toolkits.☆54Dec 5, 2025Updated 3 months ago
- High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle☆3,661Updated this week
- ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resou…☆376Aug 20, 2024Updated last year
- 🚀🚀🚀 YOLO series of PaddlePaddle implementation, PP-YOLOE+, RT-DETR, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv10, YOLO11, YOLOX, YOLOv5u, Y…☆661Jan 14, 2026Updated 2 months ago
- Easy-to-use and powerful LLM and SLM library with awesome model zoo.☆12,934Dec 17, 2025Updated 3 months ago
- An experimental project for paddle python IR.☆15Dec 4, 2023Updated 2 years ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆307Sep 10, 2024Updated last year
- PFCC 社区博客☆14Mar 16, 2026Updated last week
- All-in-One Development Tool based on PaddlePaddle☆6,084Updated this week
- ONNX Model Exporter for PaddlePaddle☆905Jan 13, 2026Updated 2 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,825Updated this week
- 飞桨智能标注,让标注快人一步☆294Nov 25, 2024Updated last year
- AnyTrans: Translate AnyText in the Image with Large Scale Models (EMNLP2024 Findings)☆24Dec 11, 2024Updated last year
- A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D …☆637Apr 22, 2025Updated 11 months ago
- PaddlePaddle Developer Community☆137Updated this week
- 飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。☆477May 24, 2024Updated last year
- [ICML 2024] Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization☆23Dec 20, 2024Updated last year
- PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-…☆287Aug 1, 2023Updated 2 years ago
- paddle code convert toolkit☆22Mar 19, 2023Updated 3 years ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101May 17, 2024Updated last year
- 【HACKATHON 预备营】飞桨启航计划集训营☆17Mar 10, 2026Updated last week
- Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-ti…☆14,130Updated this week
- Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based a…☆1,691Feb 12, 2025Updated last year
- ☆15Jan 7, 2022Updated 4 years ago
- PaddleSlim is an open-source library for deep model compression and architecture search.☆1,613Jan 4, 2026Updated 2 months ago
- ☆16Dec 25, 2025Updated 2 months ago
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆933Aug 3, 2025Updated 7 months ago
- ☆25Apr 16, 2021Updated 4 years ago
- ☆268Nov 20, 2025Updated 4 months ago
- 视觉预训练基础模型仓库☆501Apr 12, 2023Updated 2 years ago
- 深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI☆3,594Jul 25, 2024Updated last year
- Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)☆1,950Jan 24, 2026Updated last month
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识 别算法。☆415Sep 4, 2025Updated 6 months ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆9,904Sep 22, 2025Updated 6 months ago
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research☆22Dec 11, 2024Updated last year
- Awesome Easy-to-Use Deep Time Series Modeling based on PaddlePaddle, including comprehensive functionality modules like TSDataset, Analys…☆549Jul 11, 2025Updated 8 months ago
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆33Feb 10, 2026Updated last month
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,104Feb 10, 2025Updated last year