TencentARC-QQ / TagGPT
TagGPT: Large Language Models are Zero-shot Multimodal Taggers
☆63Updated last year
Alternatives and similar repositories for TagGPT:
Users that are interested in TagGPT are comparing it to the libraries listed below
- the world's first large-scale multi-modal short-video encyclopedia, where the primitive units are items, aspects, and short videos.☆60Updated last year
- ☆64Updated last year
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- ☆67Updated last year
- Bling's Object detection tool☆56Updated 2 years ago
- ☆36Updated 6 months ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆179Updated last year
- ☆159Updated last year
- WuDaoMM this is a data project☆71Updated 2 years ago
- Product1M☆87Updated 2 years ago
- Our 2nd-gen LMM☆33Updated 9 months ago
- Chinese CLIP models with SOTA performance.☆53Updated last year
- ☆28Updated 6 months ago
- Empirical Study Towards Building An Effective Multi-Modal Large Language Model☆23Updated last year
- ☆32Updated 2 years ago
- ☆22Updated last year
- Touchstone: Evaluating Vision-Language Models by Language Models☆82Updated last year
- ☆17Updated last year
- Bridging Vision and Language Model☆283Updated last year
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆53Updated last year
- Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks☆292Updated last year
- Research Code for Multimodal-Cognition Team in Ant Group☆139Updated 8 months ago
- A curated list of resources about long-context in large-language models and video understanding.☆30Updated last year
- repository for CharacterChat, a personalized social support system☆67Updated 8 months ago
- ☆65Updated last year
- ☆19Updated 2 years ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated 3 months ago
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆31Updated last year
- The Document of WenLan API, which was used to obtain image and text feature.☆37Updated 2 years ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆102Updated 2 years ago