TencentARC-QQ/QA-CLIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TencentARC-QQ/QA-CLIP)

TencentARC-QQ / QA-CLIP

Chinese CLIP models with SOTA performance.

☆63

Alternatives and similar repositories for QA-CLIP

Users that are interested in QA-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thu-ml / zh-clip
View on GitHub
☆73Jun 28, 2023Updated 3 years ago
DeepVAC / SYSZUXface
View on GitHub
DeepVAC-FACE test dataset.
☆14May 13, 2021Updated 5 years ago
OpenGVLab / M3I-Pretraining
View on GitHub
[CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.
☆91Jun 1, 2023Updated 3 years ago
showlab / DemoVLP
View on GitHub
[Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training
☆22Mar 19, 2022Updated 4 years ago
bellchenx / AudioFolder-Dataloader-PyTorch
View on GitHub
This sample includes simeple CNN classifier for music and audio-folder dataloader just like ImageFolder in torchvision.
☆11Oct 30, 2018Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
fitzpchao / Chinese_InstructBLIP
View on GitHub
Adding a Randeng translation model on top of the instructBLIP model to enable Chinese testing of instructBLIP functionality.
☆16May 30, 2023Updated 3 years ago
TencentYoutuResearch / HighlightDetection-CLC
View on GitHub
Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"
☆18Mar 21, 2023Updated 3 years ago
hlchen23 / ADPN-MM
View on GitHub
Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…
☆51Dec 30, 2023Updated 2 years ago
OpenGVLab / InternVL-MMDetSeg
View on GitHub
Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed
☆108Oct 25, 2024Updated last year
Shi-AE / sgmis
View on GitHub
Java EE 课程设计 \ Java web 课程设计基于spring Boot + Mybatis Plus + Vue + Android原生，前后端分离。附带设计报告、UML建模，团队协作赛鸽数据管理系统
☆11Oct 10, 2024Updated last year
zhangjiewu / awesome-t2i-eval
View on GitHub
A curated list of papers and resources for text-to-image evaluation.
☆30Sep 6, 2023Updated 2 years ago
vincenzodentamaro / aucoresnet
View on GitHub
AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breath
☆13Mar 18, 2022Updated 4 years ago
scenarios / WeMM
View on GitHub
☆90Jul 4, 2024Updated 2 years ago
X-PLUG / Youku-mPLUG
View on GitHub
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
☆307Jan 8, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FreedomIntelligence / FastLLM
View on GitHub
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆41Jan 4, 2024Updated 2 years ago
hahehi / placepedia
View on GitHub
A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.
☆10Jul 15, 2020Updated 6 years ago
OFA-Sys / Chinese-CLIP
View on GitHub
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
☆5,980Mar 31, 2026Updated 3 months ago
OFA-Sys / TouchStone
View on GitHub
Touchstone: Evaluating Vision-Language Models by Language Models
☆84Jan 18, 2024Updated 2 years ago
lucasjinreal / LLaVA-Magvit2
View on GitHub
LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.
☆38Jun 20, 2024Updated 2 years ago
FlyAndNotDown / nuaa-os-malloc-program
View on GitHub
💻NUAA 2018 操作系统小作业-模拟内存分配程序(BF算法)
☆13Jul 2, 2018Updated 8 years ago
corleonechensiyu / pytorch_AdvancedEast
View on GitHub
pytorch实现AdvancedEast+mobilenetv3
☆26Dec 25, 2019Updated 6 years ago
arxiver / Visual-OS-Scheduler
View on GitHub
Operating systems scheduling algorithms visualization.
☆10Sep 1, 2023Updated 2 years ago
IrvingMeng / LCE
View on GitHub
Learning Compatible Embeddings, ICCV 2021
☆33Aug 18, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pingponglabs / FaceAnime
View on GitHub
☆10Apr 22, 2021Updated 5 years ago
ArchieAlexArkhipov / Cycle-CenterNet
View on GitHub
CycleCenternet based on MMDetection
☆22Jun 28, 2023Updated 3 years ago
ShengKuangCN / BAST
View on GitHub
☆18May 28, 2025Updated last year
Tencent-QQMM / PureMM
View on GitHub
☆21Feb 29, 2024Updated 2 years ago
CVHub520 / efficientvit
View on GitHub
EfficientViT is a new family of vision models for efficient high-resolution vision.
☆32Sep 20, 2023Updated 2 years ago
Q-Future / Chinese-Q-Bench
View on GitHub
[WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese)，包含中文版【底层视觉问答】和【底层视觉描述】数据集，以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…
☆24Jan 7, 2024Updated 2 years ago
MhLiao / SynthText3D
View on GitHub
Project page of SynthText3D
☆150Dec 10, 2019Updated 6 years ago
lailiting / operate
View on GitHub
操作系统相关实验，包括分页式分段式存储管理，银行家算法，页面置换算法，以及进程调度和作业调度相关算法的c/c++题解
☆12Nov 7, 2022Updated 3 years ago
zhangbo2008 / A_star_algorithm
View on GitHub
牛逼的A*算法,最好的寻路算法
☆11Feb 28, 2020Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
360CVGroup / 360VL
View on GitHub
Our 2nd-gen LMM
☆34May 22, 2024Updated 2 years ago
ahmdtaha / distributed_sigmoid_loss
View on GitHub
Unofficial implementation for Sigmoid Loss for Language Image Pre-Training
☆11Sep 26, 2023Updated 2 years ago
BADBADBADBOY / genete_ocr_data
View on GitHub
ocr data ,detect data ,recognize data
☆29Mar 24, 2020Updated 6 years ago
eren23 / sam-clip-diffusion
View on GitHub
SAM + CLIP + DIFFUSION for image to edit objects in images using plain text
☆14Apr 14, 2023Updated 3 years ago
hy0523 / MTNet
View on GitHub
Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation[TNNLS2024]
☆14May 6, 2025Updated last year
yoon28 / SCT4DukeMTMC
View on GitHub
Single camera tracker for DukeMTMC dataset
☆36Sep 5, 2017Updated 8 years ago
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago