☆124Dec 26, 2025Updated 2 months ago
Alternatives and similar repositories for SuperCLIP
Users that are interested in SuperCLIP are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆73Jun 26, 2025Updated 8 months ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆153Jan 10, 2026Updated last month
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices☆95Nov 30, 2025Updated 3 months ago
- [NeurIPS 2025] RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning☆185Nov 7, 2025Updated 3 months ago
- Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"☆63Mar 3, 2025Updated last year
- ☆14Jul 1, 2025Updated 8 months ago
- [AAAI'25 Oral] NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark☆10Jun 10, 2025Updated 8 months ago
- [NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning☆27Dec 9, 2025Updated 2 months ago
- P^2HCT: Plug-and-Play Hierarchical C2F Transformer for Multi-Scale Feature Fusion☆19May 19, 2025Updated 9 months ago
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆123Oct 23, 2025Updated 4 months ago
- ☆27Jan 5, 2026Updated last month
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆16Jun 23, 2024Updated last year
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆27Jan 10, 2026Updated last month
- ☆21Nov 17, 2025Updated 3 months ago
- ☆12Aug 10, 2022Updated 3 years ago
- The Code of SiCL☆18Nov 5, 2024Updated last year
- Proof of concept for a reasoning model that runs locally in your browser with WebGPU acceleration☆17Jan 22, 2025Updated last year
- The first decoder-only multimodal state space model☆100May 19, 2025Updated 9 months ago
- Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition☆42Dec 5, 2024Updated last year
- ☆21Nov 27, 2025Updated 3 months ago
- A code base for the official XS-VID dataset baseline method YOLOFT☆19Dec 24, 2024Updated last year
- 动手训练一个简单的CLIP模型,加深对CLIP的理解。☆22May 20, 2025Updated 9 months ago
- Code for the paper "RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection" (ACL'25).☆33Jul 23, 2025Updated 7 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆116Jun 17, 2024Updated last year
- Featurized Query R-CNN☆45Jun 17, 2022Updated 3 years ago
- ☆77Sep 25, 2025Updated 5 months ago
- ☆17Nov 17, 2023Updated 2 years ago
- [ECCV 2024🔥] The official code for the paper DiffFAS: Face Anti-Spoofing via Generative Diffusion Models.☆42Sep 23, 2024Updated last year
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆46Oct 15, 2023Updated 2 years ago
- [ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition☆58Apr 8, 2025Updated 10 months ago
- This repository contains the code accompanying the paper "A Self-Guided Framework for Radiology Report Generation", accepted by MICCAI 20…☆21Mar 11, 2024Updated last year
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆129Nov 14, 2025Updated 3 months ago
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆56Jan 23, 2026Updated last month
- [arXiv '24] Efficient Cell Nuclei Instance Segmentation with Large Convolution Kernels☆47Aug 28, 2024Updated last year
- [IJCV 2024]☆21Nov 11, 2024Updated last year
- codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval☆20Sep 7, 2020Updated 5 years ago
- [CVPR 2025] Offical implementation of the paper "Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters The…☆31Feb 27, 2025Updated last year
- Pytorch code of AdAGeo - WACV2021☆19Apr 26, 2023Updated 2 years ago