zhangfaen / finetune-InternVL2Links
☆30Updated last year
Alternatives and similar repositories for finetune-InternVL2
Users that are interested in finetune-InternVL2 are comparing it to the libraries listed below
Sorting:
- Building a VLM model starts from the basic module.☆18Updated last year
- Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)☆18Updated 2 years ago
- ☆186Updated last year
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆106Updated last year
- Toward Universal Multimodal Embedding☆69Updated 4 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆77Updated last year
- Research Code for Multimodal-Cognition Team in Ant Group☆169Updated last month
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆128Updated 6 months ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- Fine-tuning Qwen2.5-VL for vision-language tasks | Optimized for Vision understanding | LoRA & PEFT support.☆145Updated 10 months ago
- 多模态 MM +Chat 合集☆279Updated 3 months ago
- ☆57Updated last year