基于ClipCap的看图说话Image Caption模型
☆321Apr 1, 2022Updated 3 years ago
Alternatives and similar repositories for ClipCap-Chinese
Users that are interested in ClipCap-Chinese are comparing it to the libraries listed below
Sorting:
- Simple image captioning model☆1,412Jun 9, 2024Updated last year
- Cross-lingual image captioning☆91May 9, 2022Updated 3 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆18Sep 9, 2023Updated 2 years ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆5,812Aug 29, 2025Updated 6 months ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆169Nov 3, 2022Updated 3 years ago
- 中文CLIP预训练模型☆423Dec 5, 2022Updated 3 years ago
- Image Chinese Description Generation Based on Multi-level Selective Visual Semantic Attributes☆16Nov 2, 2021Updated 4 years ago
- 图像中文描述+视觉注意力☆193Jan 9, 2020Updated 6 years ago
- transformers结构的中文OFA模型☆139Feb 13, 2023Updated 3 years ago
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆198May 9, 2023Updated 2 years ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 3 years ago
- baseline for MGTV competition 2022 PIR☆11Apr 11, 2022Updated 3 years ago
- 非常好用的工具包,可以直接安装并使用☆21Mar 18, 2022Updated 3 years ago
- 中文bigbird预训练模型☆96Jul 5, 2022Updated 3 years ago
- ☆13Jun 19, 2021Updated 4 years ago
- ☆24Apr 4, 2022Updated 3 years ago
- A curated list of image captioning and related area resources. :-)☆1,074Mar 28, 2023Updated 2 years ago
- 中文CLIP:自定义数据集,可根据文图提取向量,实现文图匹配。☆22Sep 14, 2022Updated 3 years ago
- codes for GAIIC-Track1☆15Jun 14, 2022Updated 3 years ago
- ☆14Jan 10, 2024Updated 2 years ago
- Image Captioning using combination of object detection via YOLOv5 and Encoder Decoder LSTM model☆15Oct 13, 2022Updated 3 years ago
- PyTorch implementation of Chinese image captioning on AI_challenger dataset☆34Dec 25, 2019Updated 6 years ago
- Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).☆202Jun 8, 2022Updated 3 years ago
- [ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383☆420Oct 28, 2022Updated 3 years ago
- Kanchil(鼷鹿)是世界上最小的偶蹄目动物,这个开源项目意在探索小模型(6B以下)是否也能具备和人类偏好对齐的能力。☆112Apr 1, 2023Updated 2 years ago
- CoSENT、STS、SentenceBERT☆170Feb 11, 2025Updated last year
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Nov 21, 2021Updated 4 years ago
- 对比学习 虾皮同款商品匹配☆16Jan 29, 2022Updated 4 years ago
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆246Jun 10, 2025Updated 8 months ago
- Controllable mage captioning model with unsupervised modes☆21Apr 14, 2023Updated 2 years ago
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆5,689Updated this week
- Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)☆531Apr 10, 2023Updated 2 years ago
- Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)☆19Oct 15, 2022Updated 3 years ago
- 基于词汇信息融合的中文NER模型☆170Apr 2, 2022Updated 3 years ago
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,554Apr 24, 2024Updated last year
- AI Challenger 2018 阅读理解赛道代码分享☆20Dec 6, 2018Updated 7 years ago
- Meshed-Memory Transformer for Image Captioning. CVPR 2020☆545Dec 21, 2022Updated 3 years ago
- A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)☆113Jun 6, 2022Updated 3 years ago
- 中文机器阅读理解数据集☆109Mar 29, 2021Updated 4 years ago