yangjianxin1 / ClipCap-ChineseView external linksLinks
基于ClipCap的看图说话Image Caption模型
☆321Apr 1, 2022Updated 3 years ago
Alternatives and similar repositories for ClipCap-Chinese
Users that are interested in ClipCap-Chinese are comparing it to the libraries listed below
Sorting:
- Simple image captioning model☆1,408Jun 9, 2024Updated last year
- Cross-lingual image captioning☆91May 9, 2022Updated 3 years ago
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆203Jan 28, 2024Updated 2 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆18Sep 9, 2023Updated 2 years ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆5,795Aug 29, 2025Updated 5 months ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆169Nov 3, 2022Updated 3 years ago
- 中文CLIP预训练模型☆423Dec 5, 2022Updated 3 years ago
- 图像中文描述+视觉注意力☆192Jan 9, 2020Updated 6 years ago
- transformers结构的中文OFA模型☆139Feb 13, 2023Updated 3 years ago
- CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022☆29Dec 1, 2022Updated 3 years ago
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆198May 9, 2023Updated 2 years ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 3 years ago
- baseline for MGTV competition 2022 PIR☆11Apr 11, 2022Updated 3 years ago
- 非常好用的工具包,可以直接安装并使用☆21Mar 18, 2022Updated 3 years ago
- 中文bigbird预训练模型☆96Jul 5, 2022Updated 3 years ago
- ☆13Jun 19, 2021Updated 4 years ago
- ☆24Apr 4, 2022Updated 3 years ago
- A curated list of image captioning and related area resources. :-)☆1,074Mar 28, 2023Updated 2 years ago
- Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023☆162Sep 9, 2024Updated last year
- 中文CLIP:自定义数据集,可根据文图提取向量,实现文图匹配。☆22Sep 14, 2022Updated 3 years ago
- codes for GAIIC-Track1☆15Jun 14, 2022Updated 3 years ago
- ☆14Jan 10, 2024Updated 2 years ago
- PyTorch implementation of Chinese image captioning on AI_challenger dataset☆34Dec 25, 2019Updated 6 years ago
- Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).☆202Jun 8, 2022Updated 3 years ago
- [ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383☆421Oct 28, 2022Updated 3 years ago
- 支持中英文双语视觉-文本 对话的开源可商用多模态模型。☆378Sep 23, 2023Updated 2 years ago
- Kanchil(鼷鹿)是世界上最小的偶蹄目动物,这个开源项目意在探索小模型(6B以下)是否也能具备和人类偏好对齐的能力。☆113Apr 1, 2023Updated 2 years ago
- ☆60Nov 17, 2022Updated 3 years ago
- 对比学习 虾皮同款商品匹配☆16Jan 29, 2022Updated 4 years ago
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Nov 21, 2021Updated 4 years ago
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆246Jun 10, 2025Updated 8 months ago
- Controllable mage captioning model with unsupervised modes☆21Apr 14, 2023Updated 2 years ago
- DataFountain第五届达观杯第4名方案☆49Oct 7, 2022Updated 3 years ago
- Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。☆4,148Aug 13, 2024Updated last year
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆5,669Aug 5, 2024Updated last year
- Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)☆19Oct 15, 2022Updated 3 years ago
- 基于词汇信息融合的中文NER模型☆170Apr 2, 2022Updated 3 years ago
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,555Apr 24, 2024Updated last year
- AI Challenger 2018 阅读理解赛道代码分享☆20Dec 6, 2018Updated 7 years ago