[MICCAI 2024 π₯] HLSS, the first study to explore hierarchical information inherent in histopathology images and their language descriptions for strong multi-modal representation learning
β27Aug 5, 2024Updated last year
Alternatives and similar repositories for HLSS
Users that are interested in HLSS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathologyβ12Jun 17, 2025Updated last year
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)β20Aug 24, 2023Updated 2 years ago
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Modelsβ15Nov 1, 2024Updated last year
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]β22Oct 27, 2024Updated last year
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.β24Aug 19, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A new multi-task learning framework using Vision Transformersβ11Jun 19, 2024Updated 2 years ago
- [MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"β14Nov 1, 2024Updated last year
- [CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"β26Jun 8, 2025Updated last year
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Modelsβ29Oct 20, 2025Updated 8 months ago
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)β11Dec 30, 2022Updated 3 years ago
- A codeβ29Jan 23, 2025Updated last year
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".β12Oct 11, 2024Updated last year
- Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]β33Oct 27, 2024Updated last year
- A Large Multimodal Model for Remote Sensing Change Description (IGARSS 2025)β22Dec 17, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)β25May 16, 2024Updated 2 years ago
- [ CVPR 2025 π₯] STING-BEE, the first domain-aware visual AI assistant for X-ray baggage security screening.β29Jun 27, 2025Updated last year
- β11Oct 29, 2024Updated last year
- Composed Video Retrievalβ62May 2, 2024Updated 2 years ago
- Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".β25Jul 10, 2023Updated 2 years ago
- A Multitask Conversational Vision-Language Model for Radiologyβ17Jul 3, 2025Updated 11 months ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challengesβ30Sep 24, 2023Updated 2 years ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalizationβ107Feb 11, 2024Updated 2 years ago
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videosβ24May 7, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β15Jul 24, 2022Updated 3 years ago
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)β13Mar 8, 2024Updated 2 years ago
- ARB: A Comprehensive Arabic Multimodal Reasoning Benchmarkβ17May 25, 2025Updated last year
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" β¦β35Jan 8, 2023Updated 3 years ago
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Studyβ16Nov 22, 2024Updated last year
- Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representationβ17Feb 8, 2024Updated 2 years ago
- This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires pythonβ₯3.5β13Jun 3, 2026Updated 3 weeks ago
- [CVPR 2025 π₯] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses theβ¦β46May 26, 2025Updated last year
- [NAACL 2025 π₯] CAMEL-Bench is an Arabic benchmark for evaluating multimodal models across eight domains with 29,000 questions.β38Apr 17, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing theirβ¦β22Jan 11, 2026Updated 5 months ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)β109Jun 26, 2024Updated 2 years ago
- β14Jun 25, 2022Updated 4 years ago
- β26Aug 31, 2023Updated 2 years ago
- β42Nov 9, 2023Updated 2 years ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)β16Jan 18, 2024Updated 2 years ago
- β18Apr 8, 2025Updated last year