☆72Jun 28, 2023Updated 2 years ago
Alternatives and similar repositories for zh-clip
Users that are interested in zh-clip are comparing it to the libraries listed below
Sorting:
- Chinese CLIP models with SOTA performance.☆60Aug 28, 2023Updated 2 years ago
- MeloTTS demo on Axera☆10Nov 18, 2025Updated 3 months ago
- ☆90Jul 4, 2024Updated last year
- [ICCV 2023] Learning Fine-Grained Features for Pixel-wise Video Correspondences☆18Mar 3, 2024Updated 2 years ago
- ☆16Jul 29, 2025Updated 7 months ago
- Large Multimodal Model☆15Apr 8, 2024Updated last year
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆37Sep 19, 2023Updated 2 years ago
- 一个mmcv 的logger hook, 可以用来把模型结果推送到微信上☆21Oct 11, 2022Updated 3 years ago
- Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion☆49Mar 11, 2024Updated last year
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆5,812Aug 29, 2025Updated 6 months ago
- This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothin…☆28Oct 19, 2022Updated 3 years ago
- DPNet: Dual-Path Network for Real-time Object Detection with Lightweight Attention☆21May 18, 2022Updated 3 years ago
- 基于wav2lip进行虚拟数字人训练,唇形驱动,包括数据处理流程等,模型包括96x96,192x192,192x288,288x288。☆22May 7, 2024Updated last year
- ☆19Dec 6, 2023Updated 2 years ago
- [ICLR 2023] CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding☆46Jun 9, 2025Updated 9 months ago
- 使用OpenCV部署L2CS-Net人脸朝向估计,包含C++和Python两个版本的程序,只依赖opencv库就可以运行☆21Aug 12, 2023Updated 2 years ago
- ☆134Dec 22, 2023Updated 2 years ago
- ☆21Apr 10, 2018Updated 7 years ago
- ☆28Jun 30, 2025Updated 8 months ago
- This repository is the official implementation of FLUX-CustomID. It is capable of generating images based on your face image at a level e…☆25Nov 13, 2024Updated last year
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆109Oct 25, 2024Updated last year
- ☆168Nov 9, 2023Updated 2 years ago
- ☆33Dec 18, 2023Updated 2 years ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Jul 23, 2024Updated last year
- ☆26Jul 7, 2021Updated 4 years ago
- [ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation☆459Dec 2, 2024Updated last year
- ChineseOcr Lite Mnn,超轻量级中文OCR PC Demo,使用MNN推理☆27Mar 26, 2021Updated 4 years ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆33Aug 16, 2023Updated 2 years ago
- Gstreamer based Edge AI reference application☆32Updated this week
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆191Nov 17, 2023Updated 2 years ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆74May 17, 2024Updated last year
- ☆66Feb 5, 2024Updated 2 years ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)☆194Mar 13, 2023Updated 2 years ago
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆30May 28, 2023Updated 2 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 2 years ago
- SHN-based (Stacked Hourglass Network) methods for 2D face alignment☆31Dec 17, 2019Updated 6 years ago
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs☆511Jul 21, 2023Updated 2 years ago
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆143Apr 16, 2025Updated 10 months ago