PaddlePaddle / PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
☆541Updated this week
Alternatives and similar repositories for PaddleMIX:
Users that are interested in PaddleMIX are comparing it to the libraries listed below
- 一些大语言模型和多模态模型的应用,主要包括Rag,小模型,Agent,跨模态搜索,OCR等等☆157Updated 4 months ago
- Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conv…☆421Updated this week
- ☆103Updated 11 months ago
- A toolbox of yolo models and algorithms based on MindSpore☆120Updated last week
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆95Updated last week
- 支持中英文双语视觉-文本对话的开源可商用多模态模型。☆363Updated last year
- 通义千问VLLM推理部署DEMO☆541Updated 11 months ago
- huggingface mirror download☆566Updated 3 months ago
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆574Updated this week
- ☆252Updated last month
- ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resou…☆360Updated 6 months ago
- A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.☆794Updated 2 weeks ago
- 视觉预训练基础模型仓库☆497Updated last year
- PaddlePaddle Developer Community☆98Updated this week
- Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)☆616Updated 2 months ago
- 飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大 模型等领域的全流程开发工具链。☆461Updated 9 months ago
- A toolbox of ocr models and algorithms based on MindSpore☆253Updated last week
- Enhance LLM agents with rich tool APIs☆378Updated 6 months ago
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆114Updated 4 months ago
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,817Updated 2 months ago
- ☆770Updated this week
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,065Updated 2 months ago
- ONNX Model Exporter for PaddlePaddle☆772Updated last week
- The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".☆237Updated last year
- 多模态中文LLaMA&Alpaca大语言模型(VisualCLA)☆441Updated last year
- 多模态 MM +Chat 合集☆247Updated 3 weeks ago
- Train a 1B LLM with 1T tokens from scratch by personal☆569Updated this week