XiPotatonium / LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for LAVIS
- ☆63Updated 10 months ago
- Chinese CLIP models with SOTA performance.☆48Updated last year
- ☆19Updated 2 years ago
- ☆25Updated 3 years ago
- WuDaoMM this is a data project☆66Updated 2 years ago
- ☆57Updated last year
- ☆157Updated last year
- ☆32Updated 2 years ago
- Bling's Object detection tool☆55Updated last year
- CLIP中文encoder☆21Updated 2 years ago
- ☆66Updated last year
- Tensorflow implementation for Dash☆26Updated 2 years ago
- Building a VLM model starts from the basic module.☆10Updated 7 months ago
- multi-task classifier☆21Updated last year
- 2019 CCF 大数据与计算智能大赛 视频版权检测算法 复赛第8名方案 | 8th place solution of Video Copyright Detection Algorithm Track, 2019 CCF Big Data & Computing Int…☆30Updated 4 years ago
- ☆29Updated 2 years ago
- Multimodal chatbot with computer vision capabilities integrated☆98Updated 5 months ago
- ☆59Updated last year
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- Product1M☆86Updated 2 years ago
- HHH☆33Updated 2 years ago
- 基于baichuan-7b的开源多模态大语言模型☆72Updated 11 months ago
- ☆11Updated 2 months ago
- ☆16Updated 2 years ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆164Updated 2 years ago
- ☆14Updated 7 months ago
- A new video text spotting framework with Transformer☆77Updated 2 years ago
- Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks☆179Updated last year