baidubce / Qianfan-VLLinks

Qianfan-VL: Domain-Enhanced Universal Vision-Language Models

☆158

Alternatives and similar repositories for Qianfan-VL

Users that are interested in Qianfan-VL are comparing it to the libraries listed below

Sorting:

Alibaba-NLP / VRAG
Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…
☆377Updated last week
WePOINTS / WePOINTS
☆186Updated 8 months ago
infly-ai / INF-MLLM
☆92Updated 3 weeks ago
VectorSpaceLab / MegaPairs
[ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval
☆226Updated 4 months ago
modelscope / easydistill
a toolkit on knowledge distillation for large language models
☆171Updated last week
rednote-hilab / dots.vlm1
The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.
☆260Updated 3 weeks ago
SpursGoZmy / Table-LLaVA
Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …
☆218Updated 4 months ago
deepglint / UniME
[ACM MM25] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"
☆93Updated 2 months ago
liunian-Jay / MU-GOT
PDF解析工具：GOT的vLLM加速实现，MinerU做布局识别裁剪、GOT做表格公式解析，实现RAG中的pdf解析
☆63Updated 11 months ago
LengSicong / MMR1
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
☆198Updated 3 weeks ago
yuyq96 / TextHawk
Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
☆63Updated 11 months ago
bytedance / Valley
Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.
☆252Updated 2 months ago
FlagAI-Open / OpenSeek
OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next…
☆232Updated last month
Mxoder / Maxs-Awesome-Datasets
Max的有趣数据集 / Max's awesome datasets
☆49Updated last month
a-m-team / a-m-models
a-m-team's exploration in large language modeling
☆189Updated 4 months ago
ucaslcl / Fox
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
☆155Updated last year
inclusionAI / Ling
Ling is a MoE LLM provided and open-sourced by InclusionAI.
☆215Updated 5 months ago
EvolvingLMMs-Lab / multimodal-search-r1
MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…
☆332Updated last month
Alibaba-NLP / OmniSearch
Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
☆383Updated 5 months ago
LingyvKong / OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
☆228Updated 6 months ago
alipay / Ant-Multi-Modal-Framework
Research Code for Multimodal-Cognition Team in Ant Group
☆167Updated last week
yh-hust / PDF-Wukong
【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
☆126Updated 4 months ago
jinbo0906 / Awesome-MLLM-Datasets
This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …
☆57Updated 5 months ago
bobo0810 / LearnDeepSpeed
DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）
☆178Updated 2 years ago
Tongyi-Zhiwen / QwenLong-L1
☆296Updated 4 months ago
OpenGVLab / OmniCorpus
[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
☆396Updated 5 months ago
kanhaoning / RAG-Optimization-Practices
☆68Updated 2 months ago
SUSTech-IDEA / SUS-Chat
SUS-Chat: Instruction tuning done right
☆49Updated last year
open-sciencelab / GraphGen
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
☆389Updated last week
MiniMax-AI / One-RL-to-See-Them-All
The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning
☆318Updated 4 months ago