w1oves / hqclipView external linksLinks
[ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets
☆62Aug 6, 2025Updated 6 months ago
Alternatives and similar repositories for hqclip
Users that are interested in hqclip are comparing it to the libraries listed below
Sorting:
- A Spitting Image: Modular Superpixel Tokenization in Vision Transformers☆21Sep 12, 2025Updated 5 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆20Oct 31, 2024Updated last year
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆23Mar 18, 2025Updated 10 months ago
- ☆18Sep 20, 2025Updated 4 months ago
- [WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing☆17Mar 16, 2025Updated 10 months ago
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆24Aug 7, 2025Updated 6 months ago
- ☆19Apr 1, 2025Updated 10 months ago
- [ICCV 2025] Official implementation of "AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving"☆34Jul 15, 2025Updated 6 months ago
- ☆29Mar 30, 2025Updated 10 months ago
- Caffe implementation of grad-CAM visulization technique presented at https://github.com/ramprs/grad-cam☆23Apr 17, 2017Updated 8 years ago
- ☆25Nov 30, 2023Updated 2 years ago
- [CVPR 2025] Test-Time Visual In-Context Tuning☆29Dec 31, 2025Updated last month
- [CVPR 2025 Highlight] SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning☆61Jun 26, 2025Updated 7 months ago
- [ICCV 2025] Deeply Supervised Flow-Based Generative Models☆27Jun 26, 2025Updated 7 months ago
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆76Jul 30, 2025Updated 6 months ago
- Official implementation of the paper "Watermarking Autoregressive Image Generation" (NeurIPS'25)☆56Sep 19, 2025Updated 4 months ago
- Code for paper "Masked Pre-training Enables Universal Zero-shot Denoiser" [NeurIPS 2024].☆35Nov 20, 2024Updated last year
- Explore how to get a VQ-VAE models efficiently!☆67Jul 24, 2025Updated 6 months ago
- Code for [Pattern Recognition] Prompt Learning based Source-free Domain Adaptation for Medical Image Segmentation.☆29Apr 22, 2025Updated 9 months ago
- [ICCV 2025 Highlight] BridgeDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment☆125Jan 29, 2026Updated 2 weeks ago
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆31Mar 11, 2025Updated 11 months ago
- ☆30Jun 14, 2024Updated last year
- Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"☆122Oct 2, 2025Updated 4 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆41Feb 12, 2025Updated last year
- ☆91Jan 18, 2026Updated 3 weeks ago
- Tiny AutoEncoder for Stable Diffusion Videos☆36Oct 5, 2024Updated last year
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆54Apr 9, 2025Updated 10 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆186May 21, 2025Updated 8 months ago
- 🚀 智谱清言ChatGLM-4.7 大模型逆向API【特长:超强智能体】,支持高速流式输出、支持智能体对话、支持多轮对话、支持沉思模型、支持Zero思考推理模型;仅供测试,如需商用请前往官方开放平台。☆28Feb 5, 2026Updated last week
- Towards training VQ-VAE models robustly!☆91Jul 14, 2025Updated 7 months ago
- sam-unet☆40Aug 19, 2024Updated last year
- [ECCV'24] Textual Query-Driven Mask Transformer for Domain Generalized Segmentation☆40Feb 18, 2025Updated 11 months ago
- ☆56May 27, 2025Updated 8 months ago
- Official PyTorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆101Apr 3, 2025Updated 10 months ago
- Embodied Question Answering (EQA) benchmark and method (ICCV 2025)☆46Aug 12, 2025Updated 6 months ago
- [CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention☆39Mar 11, 2025Updated 11 months ago
- [2023 TIP] An Efficient Multiscale Spatial Rearrangement MLP Architecture for Image Restoration.☆11May 7, 2024Updated last year
- [ICCV2025] Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning☆23Nov 13, 2025Updated 3 months ago
- [NeuRIPS, 2024] Multi-Human Dataset for Close Interactions.☆46Nov 14, 2024Updated last year