Chinese CLIP models with SOTA performance.
☆60Aug 28, 2023Updated 2 years ago
Alternatives and similar repositories for QA-CLIP
Users that are interested in QA-CLIP are comparing it to the libraries listed below
Sorting:
- Large Multimodal Model☆15Apr 8, 2024Updated last year
- ☆72Jun 28, 2023Updated 2 years ago
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Jun 7, 2023Updated 2 years ago
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆53Dec 30, 2023Updated 2 years ago
- Our 2nd-gen LMM☆34May 22, 2024Updated last year
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆91Jun 1, 2023Updated 2 years ago
- ☆88Jul 4, 2024Updated last year
- ☆57Jan 23, 2024Updated 2 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- 🏆 🥇 Winner solution for ICCV VQualA 2025 Face Image Quality Assessment Challenge☆25Jan 4, 2026Updated 2 months ago
- Implemention of "Realtime Multi Person Pose-Estimation" in pytorch with data from AI Challenger☆13Nov 24, 2017Updated 8 years ago
- Adding a Randeng translation model on top of the instructBLIP model to enable Chinese testing of instructBLIP functionality.☆16May 30, 2023Updated 2 years ago
- DeepVAC-FACE test dataset.☆14May 13, 2021Updated 4 years ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Sep 9, 2024Updated last year
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Jan 18, 2024Updated 2 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆22Mar 19, 2022Updated 3 years ago
- 小红书API数据采集☆16Nov 1, 2024Updated last year
- This repository is TensorRT implement of PINet☆18Nov 1, 2022Updated 3 years ago
- Vision Xformers☆23May 11, 2023Updated 2 years ago
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"☆18Mar 21, 2023Updated 2 years ago
- A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series …☆19Apr 24, 2024Updated last year
- ☆23Aug 17, 2024Updated last year
- Zero-label image classification via OpenCLIP knowledge distillation☆142Sep 12, 2023Updated 2 years ago
- A curated list of papers and resources for text-to-image evaluation.☆30Sep 6, 2023Updated 2 years ago
- CycleCenternet based on MMDetection☆22Jun 28, 2023Updated 2 years ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆23Jun 28, 2024Updated last year
- Chinese Vision-Language Understanding Evaluation☆23Dec 26, 2024Updated last year
- 支持中英文双语视觉-文本对话的开源可商用多模态模型。☆378Sep 23, 2023Updated 2 years ago
- Accelerate segment anything model inference using Tensorrt 8.6.1.6☆105Oct 20, 2023Updated 2 years ago
- 🔥[Information Fusion 2024, Official Code] for paper "Prompt-guided image color aesthetics assessment: Models, datasets and benchmarks". …☆68Jul 29, 2025Updated 7 months ago
- Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks☆304Jan 8, 2024Updated 2 years ago
- ☆23May 7, 2024Updated last year
- Building Pytorch Server with Flask☆31Mar 12, 2018Updated 7 years ago
- pytorch实现AdvancedEast+mobilenetv3☆26Dec 25, 2019Updated 6 years ago
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Jun 24, 2024Updated last year
- Lion: Kindling Vision Intelligence within Large Language Models☆51Jan 25, 2024Updated 2 years ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101May 17, 2024Updated last year
- A light and fast one class detection framework for edge devices. We provide face detector, head detector, pedestrian detector, vehicle de…☆205Oct 6, 2020Updated 5 years ago