☆131Dec 26, 2025Updated 4 months ago
Alternatives and similar repositories for SuperCLIP
Users that are interested in SuperCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆76Jun 26, 2025Updated 10 months ago
- ☆61May 13, 2025Updated 11 months ago
- Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"☆63Mar 3, 2025Updated last year
- [AAAI'25 Oral] NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark☆11Jun 10, 2025Updated 10 months ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆130Oct 23, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models☆123Apr 25, 2025Updated last year
- [NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning☆33Apr 19, 2026Updated 2 weeks ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆160Jan 10, 2026Updated 3 months ago
- A code base for the official XS-VID dataset baseline method YOLOFT☆20Dec 24, 2024Updated last year
- [CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens☆98Apr 21, 2026Updated last week
- The first decoder-only multimodal state space model☆104May 19, 2025Updated 11 months ago
- [NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training☆224Mar 20, 2025Updated last year
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆212Jan 5, 2026Updated 3 months ago
- mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations☆80Jan 12, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition☆41Dec 5, 2024Updated last year
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- ☆25Nov 17, 2025Updated 5 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆118Jun 17, 2024Updated last year
- Visual Generation Tuning☆100Apr 16, 2026Updated 2 weeks ago
- [AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning☆122Dec 3, 2025Updated 5 months ago
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆45Oct 15, 2023Updated 2 years ago
- P^2HCT: Plug-and-Play Hierarchical C2F Transformer for Multi-Scale Feature Fusion[ICME2026]☆24May 19, 2025Updated 11 months ago
- This repository contains the **official implementation** of the paper: "VL2Lite: Task-Specific Knowledge Distillation from Large Vision-…☆18Mar 23, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆29Mar 18, 2026Updated last month
- [ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition☆58Apr 8, 2025Updated last year
- EAFT(Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting) official repo☆97Jan 15, 2026Updated 3 months ago
- [arXiv '24] Efficient Cell Nuclei Instance Segmentation with Large Convolution Kernels☆47Aug 28, 2024Updated last year
- ☆12Aug 10, 2022Updated 3 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Introducing OWLv2: Google's Breakthrough in Zero-Shot Object Detection☆28Oct 20, 2023Updated 2 years ago
- 360M model running in the browser on WebGPU☆23Aug 20, 2024Updated last year
- [ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection☆34Jul 23, 2025Updated 9 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆179Mar 1, 2025Updated last year
- ☆19Apr 11, 2024Updated 2 years ago
- [IJCV 2024]☆21Nov 11, 2024Updated last year
- Proof of concept for a reasoning model that runs locally in your browser with WebGPU acceleration☆19Jan 22, 2025Updated last year
- DeepPerf is an end-to-end deep learning based solution that can train a software performance prediction model from a limited number of sa…☆17Mar 16, 2021Updated 5 years ago
- Pytorch code of AdAGeo - WACV2021☆19Apr 26, 2023Updated 3 years ago
- Official code for "To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition" CVPR IMW 2025☆39Oct 4, 2025Updated 6 months ago