Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)
β37Jan 1, 2024Updated 2 years ago
Alternatives and similar repositories for clippy
Users that are interested in clippy are comparing it to the libraries listed below
Sorting:
- [WIP] Code for LangToMoβ20Jun 25, 2025Updated 8 months ago
- π€ [ICLR'25] Multimodal Video Understanding Framework (MVU)β55Jan 31, 2025Updated last year
- This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires pythonβ₯3.5β13Feb 16, 2026Updated 2 weeks ago
- β14Jun 25, 2022Updated 3 years ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)β108Jun 26, 2024Updated last year
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)β20Aug 24, 2023Updated 2 years ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"β20Apr 20, 2023Updated 2 years ago
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodimentβ24Jan 9, 2025Updated last year
- [β CVPR 2025 Highlight β] Official Implementation of the paper STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing froβ¦β29Apr 22, 2025Updated 10 months ago
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Modelsβ28Oct 20, 2025Updated 4 months ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddingsβ11Feb 24, 2025Updated last year
- [MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"β14Nov 1, 2024Updated last year
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathologyβ12Jun 17, 2025Updated 8 months ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalizationβ110Feb 11, 2024Updated 2 years ago
- Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transformeβ¦β17Nov 24, 2024Updated last year
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" β¦β34Jan 8, 2023Updated 3 years ago
- WACV 2024: "PathLDM: Text conditioned Latent Diffusion Model for Histopathology"β48Jul 7, 2024Updated last year
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentorsβ31Jun 2, 2024Updated last year
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]β22Oct 27, 2024Updated last year
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMsβ35Feb 26, 2026Updated last week
- The official implementation of "Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency".β13Jul 16, 2024Updated last year
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Modelsβ15Nov 1, 2024Updated last year
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)β15Jul 4, 2022Updated 3 years ago
- [ CVPR 2025 π₯] STING-BEE, the first domain-aware visual AI assistant for X-ray baggage security screening.β24Jun 27, 2025Updated 8 months ago
- β35Feb 5, 2024Updated 2 years ago
- [ACCV 2024] ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes πππβ37Jan 21, 2025Updated last year
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".β12Oct 11, 2024Updated last year
- β18Sep 23, 2024Updated last year
- Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformersβ21Aug 2, 2024Updated last year
- β18Dec 17, 2022Updated 3 years ago
- Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"β17Oct 6, 2025Updated 4 months ago
- [ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policyβ227Mar 29, 2025Updated 11 months ago
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.β17Aug 19, 2025Updated 6 months ago
- Official implementation of the paper "PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning"β24Apr 17, 2025Updated 10 months ago
- Visual Speech Recongnitionβ19Dec 24, 2024Updated last year
- [ICRA'24] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learningβ70Aug 4, 2024Updated last year
- This is an official implementation of GRIT-VLPβ20Aug 8, 2022Updated 3 years ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)β25May 16, 2024Updated last year
- DOFA-CLIP: Multimodal VisionβLanguage Foundation Models for Earth Observationβ37Jul 30, 2025Updated 7 months ago