paperClub-hub / chinese_clipLinks

中文CLIP：自定义数据集，可根据文图提取向量，实现文图匹配。

☆22

Alternatives and similar repositories for chinese_clip

Users that are interested in chinese_clip are comparing it to the libraries listed below

Sorting:

joker-star-l / ai_lab5
人工智能实验五：多模态情感分类
☆16Updated 3 years ago
Tencent / Lichee
一个多模态内容理解算法框架，其中包含数据处理、预训练模型、常见模型以及模型加速等模块。
☆323Updated 4 years ago
yangjianxin1 / ClipCap-Chinese
基于ClipCap的看图说话Image Caption模型
☆319Updated 3 years ago
iflytek / VLE
VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)
☆194Updated 2 years ago
yangjianxin1 / OFA-Chinese
transformers结构的中文OFA模型
☆136Updated 2 years ago
MUGE-2021 / image-generation-baseline
☆32Updated 3 years ago
yuanxiaosc / Multimodal-short-video-dataset-and-baseline-classification-model
500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型（TensorFlow2.0）。
☆134Updated 6 years ago
soup-L / Multimodal_retrieval
基于多模态检索的互联网图文匹配
☆15Updated last year
billjie1 / Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
☆169Updated 3 years ago
fuxuelinwudi / 2022-gaiic-track1-itmatch-baseline2022.3.2
☆22Updated 3 years ago
kitsch231 / pytorch_fake_news_Classification_mml
使用pytorch完成的一个多模态分类任务，文本和图像部分分别使用了bert和resnet提取特征（在config里可以组合多种模型）,在我的小规模数据集上取得了良好的性能（验证集acc96%）
☆81Updated 2 years ago
applenob / clip_chinese_text_encoder
CLIP中文encoder
☆22Updated 3 years ago
seanzhang-zhichen / PytorchBilstmCRF-Information-Extraction
基于Bilstm + CRF的信息抽取模型
☆36Updated 4 years ago
Asthestarsfalll / Sparse_MultiLabel_Categorical_CrossEntropy
Sparse Multilabel Categorical Crossentropy
☆11Updated 2 years ago
TencentARC-QQ / QA-CLIP
Chinese CLIP models with SOTA performance.
☆59Updated 2 years ago
zhanghaok / BERT-CRF-NER
基于BERT-CRF的命名实体识别模型
☆13Updated 3 years ago
Mingrui-Li / Qwen-VL-Lora-Model
可以成功Lora微调的Qwen-VL模型
☆16Updated 2 years ago
thu-ml / zh-clip
☆72Updated 2 years ago
yangjianxin1 / CLIP-Chinese
中文CLIP预训练模型
☆419Updated 2 years ago
BeatsLeo / ClipCap-Chinese
DIP & NLP期末大作业 — 课程设计
☆19Updated 2 years ago
chuhaojin / BriVL-BUA-applications
Bling's Object detection tool
☆56Updated 2 years ago
shawroad / Text-Generation-Chinese-Pytorch
☆14Updated last year
zzz0627 / DataScraping-LLMs-FineTuning
此项目用于自动化采集、处理和可视化医疗问答数据，可助力构建高质量医疗问答对数据集。同时提供使用预处理后的数据集对Qwen-7B-Chat进行微调的详细说明。
☆21Updated 11 months ago
WatchTower-Liu / VLM-learning
Building a VLM model starts from the basic module.
☆18Updated last year
XiPotatonium / LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
☆10Updated 2 years ago
360CVGroup / SEEChat
Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM
☆101Updated last year
km1994 / AwesomeMultiModel
【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享大语言模型（LLMs），大模型高效微调（SFT）,检索增强生成（RAG），智能体（Agent），PPT自动生成, 角色扮演，文生图（Stable Diffusion），图像文字识别（OCR），语音识别（ASR），语…
☆42Updated 7 months ago
sjy0727 / CLIP-Text-Image-Retrieval
该项目旨在通过输入文本描述来检索与之相匹配的图片。
☆42Updated 2 years ago
redysky / multimodel
商品图像检索、多模态、深度学习
☆31Updated 4 years ago
Pillars-Creation / Visualglm-image-to-text
补充了一些Visualglm缺少的文件，可以对Visualglm进行训练，实例中是对人脸做了面相的识别
☆13Updated 2 years ago