[ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets
☆64Aug 6, 2025Updated 7 months ago
Alternatives and similar repositories for hqclip
Users that are interested in hqclip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR2023] Practical Network Acceleration with Tiny Sets☆14Jul 28, 2023Updated 2 years ago
- ☆20Sep 19, 2023Updated 2 years ago
- [WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing☆17Mar 16, 2025Updated last year
- [CVPR 2025] Test-Time Visual In-Context Tuning☆30Dec 31, 2025Updated 2 months ago
- ☆19Apr 1, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆25Nov 30, 2023Updated 2 years ago
- ☆30Mar 30, 2025Updated 11 months ago
- [ECCV 2024] Official project of CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning☆44Jul 10, 2024Updated last year
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆20Oct 31, 2024Updated last year
- Code for paper "Masked Pre-training Enables Universal Zero-shot Denoiser" [NeurIPS 2024].☆35Nov 20, 2024Updated last year
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆29Apr 15, 2025Updated 11 months ago
- ☆16Jul 17, 2025Updated 8 months ago
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆22Oct 28, 2025Updated 4 months ago
- A Spitting Image: Modular Superpixel Tokenization in Vision Transformers☆21Sep 12, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official Pytorch implementation of the AAAI 2025 "Spiking Point Transformer for Point Cloud Classification"☆15Apr 12, 2025Updated 11 months ago
- 🚀 智谱清言ChatGLM-4.7 大模型逆向API【特长:超强智能体】,支持高速流式输出、支持智能体对话、支持多轮对话、支持沉思模型、支持Zero思考推理模型;仅供测试,如需商用请前往官方开放平台。☆41Feb 26, 2026Updated last month
- Code for [Pattern Recognition] Prompt Learning based Source-free Domain Adaptation for Medical Image Segmentation.☆30Apr 22, 2025Updated 11 months ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆23Mar 18, 2025Updated last year
- [CVPR2025] VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding☆24Mar 24, 2025Updated last year
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆150Feb 19, 2025Updated last year
- [AAAI 2024] SVDP: Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction☆32Apr 26, 2024Updated last year
- [ICCV 2025] Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models?☆23Sep 16, 2025Updated 6 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Caffe implementation of grad-CAM visulization technique presented at https://github.com/ramprs/grad-cam☆23Apr 17, 2017Updated 8 years ago
- [AAAI2025] Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient☆44Apr 17, 2025Updated 11 months ago
- Agent-to-Sim Learning Interactive Behavior from Casual Videos.☆47Oct 16, 2024Updated last year
- Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision☆145Feb 6, 2026Updated last month
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆45Mar 11, 2025Updated last year
- The code implementation for UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings (ICLR 2026).☆50Feb 25, 2026Updated last month
- Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks☆14May 12, 2025Updated 10 months ago
- [NeurIPS 2024] Visual Perception by Large Language Model’s Weights☆56Mar 31, 2025Updated 11 months ago
- [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"☆42Apr 30, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- using pvanet framework train mobilenet-v2 for objects detection, papaer: https://arxiv.org/abs/1611.08588☆13Feb 13, 2019Updated 7 years ago
- sam-unet☆41Aug 19, 2024Updated last year
- Official repo for [AAAI 2026 Oral] "S5: Scalable Semi-Supervised Semantic Segmentation in Remote Sensing"☆33Dec 4, 2025Updated 3 months ago
- [NeurIPS'23] Binary Classification with Confidence Difference☆10May 13, 2024Updated last year
- Offical implementation of "Efficient 3D Recognition with Event-driven Spike Sparse Convolution" (AAAI2025)☆27Jul 7, 2025Updated 8 months ago
- ☆12Aug 15, 2024Updated last year
- [ECCV2024] XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution☆78Jun 27, 2024Updated last year