Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)
β37Jan 1, 2024Updated 2 years ago
Alternatives and similar repositories for clippy
Users that are interested in clippy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π€ [ICLR'25] Multimodal Video Understanding Framework (MVU)β56Jan 31, 2025Updated last year
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)β109Jun 26, 2024Updated last year
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"β36Jun 17, 2024Updated last year
- This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires pythonβ₯3.5β13Jun 3, 2026Updated last week
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)β20Aug 24, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β38Jun 2, 2026Updated last week
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Evaβ¦β17Sep 23, 2024Updated last year
- [Main Conference @ EACL'26] [Workshop @ NeurIPS'24] ποΈ LVNet.β43Feb 10, 2026Updated 4 months ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalizationβ107Feb 11, 2024Updated 2 years ago
- [MICCAI 2024 π₯] HLSS, the first study to explore hierarchical information inherent in histopathology images and their language descriptiβ¦β27Aug 5, 2024Updated last year
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodimentβ25Jan 9, 2025Updated last year
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddingsβ11Feb 24, 2025Updated last year
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learningβ12Aug 23, 2025Updated 9 months ago
- WACV 2024: "PathLDM: Text conditioned Latent Diffusion Model for Histopathology"β51Jul 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Magnification Prior: A Self-Supervised Method for Learning Representations on Breast Cancer Histopathological Images (WACV 2023)β15Mar 13, 2023Updated 3 years ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challengesβ30Sep 24, 2023Updated 2 years ago
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Modelsβ29Oct 20, 2025Updated 7 months ago
- [ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policyβ229Mar 29, 2025Updated last year
- [MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"β14Nov 1, 2024Updated last year
- [ICRA'24] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learningβ72Aug 4, 2024Updated last year
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" β¦β35Jan 8, 2023Updated 3 years ago
- β14Aug 12, 2022Updated 3 years ago
- The official implementation of "Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency".β13Jul 16, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentorsβ31Jun 2, 2024Updated 2 years ago
- Official Implementation of Curriculum of Data Augmentation for Long-tailed Recognition (CUDA) (ICLR'23 Spotlight)β23May 26, 2023Updated 3 years ago
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathologyβ12Jun 17, 2025Updated 11 months ago
- PyTorch code for AWRaCLe: All-Weather Image Restoration using Visual In-Context Learningβ26Mar 22, 2025Updated last year
- [ICLR 2022] Official implementation of "It Takes Two to Tango: Mixup for Deep Metric Learning".β36May 15, 2024Updated 2 years ago
- Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networksβ24Sep 6, 2022Updated 3 years ago
- β17Sep 23, 2024Updated last year
- Official implementation of the paper "PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning"β24Apr 17, 2025Updated last year
- Code for NeurIPS 2023 paper "Active Vision Reinforcement Learning with Limited Visual Observability"β56Oct 10, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official repository for "On Generating Transferable Targeted Perturbations" (ICCV 2021)β63Mar 25, 2023Updated 3 years ago
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"β61Jun 12, 2023Updated 3 years ago
- β18Dec 17, 2022Updated 3 years ago
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".β12Oct 11, 2024Updated last year
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"β16Nov 20, 2024Updated last year
- Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformersβ21Aug 2, 2024Updated last year
- Theia: Distilling Diverse Vision Foundation Models for Robot Learningβ276Nov 6, 2025Updated 7 months ago