paperClub-hub / chinese_clipLinks
中文CLIP:自定义数据集,可根据文图提取向量,实现文图匹配。
☆22Updated 2 years ago
Alternatives and similar repositories for chinese_clip
Users that are interested in chinese_clip are comparing it to the libraries listed below
Sorting:
- 基于ClipCap的看图说话Image Caption模型☆310Updated 3 years ago
- 人工智能实验五:多模态情感分类☆15Updated 3 years ago
- 一个多模态内容理解算法框架,其中包含数据处理、预 训练模型、常见模型以及模型加速等模块。☆320Updated 3 years ago
- VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)☆193Updated 2 years ago
- 基于多模态检索的互联网图文匹配☆14Updated last year
- 多模态视频分类模型☆22Updated 2 years ago
- transformers结构的中文OFA模型☆136Updated 2 years ago
- ☆22Updated 3 years ago
- 中文CLIP预训练模型☆417Updated 2 years ago
- Toward Universal Multimodal Embedding☆55Updated last month
- Building a VLM model starts from the basic module.☆17Updated last year
- 使用pytorch完成的一个多模态分类任务,文本和图像部分分别使用了bert和resnet提取特征(在config里可以组合多种模型),在我的小规模数据集上取得了良好的性能(验证集acc96%)☆80Updated 2 years ago
- DIP & NLP期末大作业 — 课程设计☆19Updated 2 years ago
- Chinese CLIP models with SOTA performance.☆57Updated 2 years ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆169Updated 2 years ago
- Sparse Multilabel Categorical Crossentropy☆11Updated last year
- 可以成功Lora微调的Qwen-VL模型☆17Updated last year
- 500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。☆130Updated 6 years ago
- 基于 BERT 模型的中文文本分类工具☆67Updated 3 years ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆100Updated last year
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Updated 2 years ago
- ☆14Updated last year
- ATEC2023——赛道一: 大模型的知识引入Rank7方案分享☆24Updated 10 months ago
- 基于Bilstm + CRF的信息抽取模型☆36Updated 4 years ago
- 模式识别课设代码:图文生成(CLIP+DALLE+BriVL)☆21Updated 2 years ago
- 基于BERT-CRF的命名实体识别模型☆14Updated 3 years ago
- ☆16Updated last year
- 该项目旨在通过输入文本描述来检索与之相匹配的图片。☆41Updated 2 years ago
- ☆32Updated 2 years ago
- ☆30Updated last year