CLUEbenchmark / SuperCLUE-Video
中文原生多层次文生视频测评基准
☆17Updated 10 months ago
Alternatives and similar repositories for SuperCLUE-Video
Users that are interested in SuperCLUE-Video are comparing it to the libraries listed below
Sorting:
- 中文原生文生图测评基准☆9Updated 10 months ago
- Chinese CLIP models with SOTA performance.☆55Updated last year
- the world's first large-scale multi-modal short-video encyclopedia, where the primitive units are items, aspects, and short videos.☆61Updated last year
- ☆67Updated last year
- ☆32Updated 2 years ago
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆39Updated last year
- WuDaoMM this is a data project☆73Updated 3 years ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆181Updated last year
- ☆28Updated last year
- ☆66Updated last year
- ☆19Updated 3 years ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆55Updated last year
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆31Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 8 months ago
- ☆59Updated 2 years ago
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆21Updated last year
- Bling's Object detection tool☆56Updated 2 years ago
- breezedeus的各种分享☆22Updated 2 years ago
- ☆56Updated last year
- A more efficient GLM implementation!☆55Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Updated last year
- ☆69Updated last year
- ☆79Updated last year
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆80Updated 10 months ago
- Our 2nd-gen LMM☆33Updated 11 months ago
- Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up…☆24Updated 5 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆60Updated 6 months ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆167Updated 2 years ago
- ☆38Updated 7 months ago