thu-ml / zh-clipView external linksLinks
☆72Jun 28, 2023Updated 2 years ago
Alternatives and similar repositories for zh-clip
Users that are interested in zh-clip are comparing it to the libraries listed below
Sorting:
- Chinese CLIP models with SOTA performance.☆60Aug 28, 2023Updated 2 years ago
- MeloTTS demo on Axera☆10Nov 18, 2025Updated 2 months ago
- ☆88Jul 4, 2024Updated last year
- Official codes for ConMIM (ICLR 2023)☆58Feb 8, 2023Updated 3 years ago
- [ICCV 2023] Learning Fine-Grained Features for Pixel-wise Video Correspondences☆18Mar 3, 2024Updated last year
- SDXL API provides a seamless interface for image generation and retrieval using Stable Diffusion XL integrated with Cloudflare AI Workers…☆13Feb 29, 2024Updated last year
- ☆16Jul 29, 2025Updated 6 months ago
- Large Multimodal Model☆15Apr 8, 2024Updated last year
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆213Feb 27, 2024Updated last year
- 一个mmcv 的logger hook, 可以用来把模型结果推送到微信上☆21Oct 11, 2022Updated 3 years ago
- Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion☆49Mar 11, 2024Updated last year
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆5,795Aug 29, 2025Updated 5 months ago
- This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothin…☆28Oct 19, 2022Updated 3 years ago
- DPNet: Dual-Path Network for Real-time Object Detection with Lightweight Attention☆21May 18, 2022Updated 3 years ago
- 基于wav2lip进行虚拟数字人训练,唇形驱动,包括数据处理流程等,模型包括96x96,192x192,192x288,288x288。☆22May 7, 2024Updated last year
- ☆19Dec 6, 2023Updated 2 years ago
- ☆22Dec 11, 2024Updated last year
- 使用OpenCV部署L2CS-Net人脸朝向估计,包含C++和Python两个版本的程序,只依赖opencv库就可以运行☆21Aug 12, 2023Updated 2 years ago
- [ICLR 2023] CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding☆46Jun 9, 2025Updated 8 months ago
- ☆133Dec 22, 2023Updated 2 years ago
- Taiyi-Diffusion-XL训练代码☆23Jun 5, 2024Updated last year
- ☆21Apr 10, 2018Updated 7 years ago
- ☆28Jun 30, 2025Updated 7 months ago
- This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimoda…☆31Mar 9, 2025Updated 11 months ago
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆109Oct 25, 2024Updated last year
- ☆168Nov 9, 2023Updated 2 years ago
- ☆26Jul 7, 2021Updated 4 years ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Jul 23, 2024Updated last year
- [ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation☆458Dec 2, 2024Updated last year
- Gstreamer based Edge AI reference application☆32Updated this week
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆33Aug 16, 2023Updated 2 years ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆190Nov 17, 2023Updated 2 years ago
- SaccadeNet : mimic how human locate accurate bounding box☆29Jul 10, 2019Updated 6 years ago
- VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)☆194Mar 13, 2023Updated 2 years ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆30May 28, 2023Updated 2 years ago
- ☆14Mar 12, 2023Updated 2 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 2 years ago
- 使用django+pyecharts+PP-Human开发的动态数据大屏, 有人流数据的采集入库, 打架、摔倒等事件警报,口罩检测等实用功能。边缘端版本使用onnx推理提升效率,服务端版本支持视频流推拉☆33May 3, 2023Updated 2 years ago