☆131Dec 26, 2025Updated 3 months ago
Alternatives and similar repositories for SuperCLIP
Users that are interested in SuperCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆60May 13, 2025Updated 11 months ago
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆76Jun 26, 2025Updated 9 months ago
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices☆115Nov 30, 2025Updated 4 months ago
- [NeurIPS 2025] RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning☆198Nov 7, 2025Updated 5 months ago
- Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"☆63Mar 3, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [AAAI'25 Oral] NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark☆11Jun 10, 2025Updated 10 months ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆128Oct 23, 2025Updated 5 months ago
- OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models☆132Apr 25, 2025Updated 11 months ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆155Jan 10, 2026Updated 3 months ago
- A code base for the official XS-VID dataset baseline method YOLOFT☆20Dec 24, 2024Updated last year
- The first decoder-only multimodal state space model☆101May 19, 2025Updated 10 months ago
- Project that regroup the state-of-the-art knowledge distillation approaches for unsupervised anomaly detection☆15Oct 10, 2025Updated 6 months ago
- Featurized Query R-CNN☆45Jun 17, 2022Updated 3 years ago
- ☆17Nov 17, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training☆224Mar 20, 2025Updated last year
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆209Jan 5, 2026Updated 3 months ago
- Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition☆41Dec 5, 2024Updated last year
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆117Jun 17, 2024Updated last year
- Visual Generation Tuning☆99Apr 2, 2026Updated last week
- [AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning☆114Dec 3, 2025Updated 4 months ago
- P^2HCT: Plug-and-Play Hierarchical C2F Transformer for Multi-Scale Feature Fusion[ICME2026]☆24May 19, 2025Updated 10 months ago
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆29Mar 18, 2026Updated 3 weeks ago
- [ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition☆58Apr 8, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- [ACL 2023] PyTorch Implementation of Zero-and Few-Shot Event Detection via Prompt-Based Meta Learning☆16Jun 6, 2023Updated 2 years ago
- Towards Scalable Pre-training of Visual Tokenizers for Generation☆468Mar 9, 2026Updated last month
- [ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection☆34Jul 23, 2025Updated 8 months ago
- ☆20Apr 11, 2024Updated 2 years ago
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆179Mar 1, 2025Updated last year
- ☆14Dec 11, 2024Updated last year
- Official code for "To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition" CVPR IMW 2025☆39Oct 4, 2025Updated 6 months ago
- [CVPR 2025] Offical implementation of the paper "Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters The…☆32Mar 12, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ECCV 2024🔥] The official code for the paper DiffFAS: Face Anti-Spoofing via Generative Diffusion Models.☆42Sep 23, 2024Updated last year
- Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning☆322Mar 26, 2025Updated last year
- The codes for Bit-mask Robust Contrastive Knowledge Distillation for Unsupervised Semantic Hashing (WWW2024)☆31Mar 3, 2025Updated last year
- VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning☆332Feb 9, 2026Updated 2 months ago
- [NeurIPS 2024] Code, Dataset, Samples for the VATT paper “ Tell What You Hear From What You See - Video to Audio Generation Through Text”☆36Jul 24, 2025Updated 8 months ago
- A simple codebase for image-based person re-id☆54Jul 7, 2021Updated 4 years ago
- ☆83Sep 25, 2025Updated 6 months ago