XiPotatonium / LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
☆10Updated last year
Alternatives and similar repositories for LAVIS:
Users that are interested in LAVIS are comparing it to the libraries listed below
- ☆65Updated last year
- WuDaoMM this is a data project☆70Updated 2 years ago
- Chinese CLIP models with SOTA performance.☆53Updated last year
- ☆158Updated last year
- ☆67Updated last year
- ☆32Updated 2 years ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 3 years ago
- Bling's Object detection tool☆56Updated 2 years ago
- Multimodal chatbot with computer vision capabilities integrated☆100Updated 8 months ago
- source codes of CCF BDCI champion team☆27Updated 5 years ago
- ☆19Updated 2 years ago
- ☆57Updated 2 years ago
- multi-task classifier☆22Updated last year
- 2019 CCF 大数据与计算智能大赛 视频版权检测算法 复赛第8名方案 | 8th place solution of Video Copyright Detection Algorithm Track, 2019 CCF Big Data & Computing Int…☆30Updated 5 years ago
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- ☆27Updated 3 years ago
- lightweighted deep learning inference service framework☆40Updated 3 years ago
- Product1M☆87Updated 2 years ago
- Building a VLM model starts from the basic module.☆11Updated 9 months ago
- CLIP中文encoder☆22Updated 2 years ago
- 500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。☆128Updated 5 years ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆36Updated 4 months ago
- ☆62Updated last year
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆166Updated 2 years ago
- ☆56Updated last year
- A new video text spotting framework with Transformer☆77Updated 2 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆27Updated last year
- 图片简易标注工具,标注类似ICDAR数据集,支持多边形标注,文本标注,方便OCR数据集标注。☆54Updated 5 years ago
- Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)☆18Updated last year