360CVGroup / SEEChat
Multimodal chatbot with computer vision capabilities integrated
☆98Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for SEEChat
- ☆66Updated last year
- 基于baichuan-7b的开源多模态大语言模型☆72Updated 11 months ago
- Research Code for Multimodal-Cognition Team in Ant Group☆123Updated 4 months ago
- ☆77Updated 6 months ago
- ☆156Updated 8 months ago
- Chinese CLIP models with SOTA performance.☆48Updated last year
- ☆55Updated 9 months ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆175Updated last year
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆68Updated 2 months ago
- transformers结构的中文OFA模型☆123Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆36Updated 2 months ago
- Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks☆284Updated 10 months ago
- ☆84Updated 4 months ago
- ☆30Updated 6 months ago
- 支持中英文双语视觉-文本对话的开源可商用多模态模型。