OpenBMB / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

☆10

Related projects ⓘ

Alternatives and complementary repositories for vllm

yh-hust / PDF-Wukong
【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
☆93Updated 3 weeks ago
large-ocr-model / large-ocr-model.github.io
☆156Updated 8 months ago
alipay / Ant-Multi-Modal-Framework
Research Code for Multimodal-Cognition Team in Ant Group
☆122Updated 4 months ago
bytedance / TextHarmony
The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation
☆65Updated last month
ucaslcl / Fox
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
☆127Updated 5 months ago
yuyq96 / TextHawk
Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
☆51Updated last week
Ucas-HaoranWei / Vary-tiny-600k
Vary-tiny codebase upon LAVIS （for training from scratch）and a PDF image-text pairs data (about 600k including English/Chinese)
☆68Updated last month
Ucas-HaoranWei / Vary-family
☆55Updated 9 months ago
LDLINGLINGLING / adan_application
个人项目地址，一些大语言模型和多模态模型的应用
☆117Updated last week
LayTextLLM / LayTextLLM
☆64Updated this week
xverse-ai / XVERSE-V-13B
☆77Updated 6 months ago
WatchTower-Liu / VLM-learning
Building a VLM model starts from the basic module.
☆10Updated 7 months ago
LinWeizheDragon / FLMR
The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.
☆68Updated 2 months ago
360CVGroup / SEEChat
Multimodal chatbot with computer vision capabilities integrated
☆98Updated 5 months ago
MonolithFoundation / Bumblebee
A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.
☆36Updated 2 months ago
bzluan / TextCoT
The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.
☆32Updated last month
harrytea / Awesome-Document-Understanding
Document Artifical Intelligence
☆127Updated last month
LingyvKong / OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
☆194Updated 3 weeks ago
opendatalab / VIGC
AAAI 2024: Visual Instruction Generation and Correction
☆90Updated 9 months ago
SCUT-DLVCLab / GPT-4V_OCR
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆120Updated last year
RLHF-V / RLHF-V
[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
☆230Updated 2 months ago
ggg0919 / cantor
☆67Updated 6 months ago
UniModal4Reasoning / StructEqTable-Deploy
A High-efficiency Open-source Toolkit for Table-to-Latex Task
☆143Updated last week
codefuse-ai / CodeFuse-MFT-VLM
☆33Updated 3 weeks ago
360CVGroup / 360VL
☆30Updated 5 months ago
LukeForeverYoung / UReader
☆112Updated 8 months ago
Chunchunwumu / SEMv3
The official PyTorch implementation of SEMv3.
☆27Updated 5 months ago
hhaAndroid / awesome-mm-chat
多模态 MM +Chat 合集
☆204Updated last week
yfzhang114 / SliME
✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models
☆137Updated this week
thu-ml / zh-clip
☆66Updated last year