[NeurIPS24] VisMin: Visual Minimal-Change Understanding
☆19Mar 3, 2025Updated last year
Alternatives and similar repositories for vismin
Users that are interested in vismin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆22Oct 8, 2024Updated last year
- [WACV 2025-Oral Presentation] Test-Time Adaptation in Point Clouds: Leveraging Sampling Variation with Weight Averaging☆12Mar 31, 2025Updated last year
- [CVPR 2025] Spectral Informed Mamba for Robust Point Cloud Processing☆27Jun 22, 2025Updated 11 months ago
- (Best Paper Awar-MedAGI) Boosting Vision Language Models for Histopathology Classification☆18May 26, 2025Updated last year
- [PR 2024] TFS-ViT: Token-Level Feature Stylization for Domain Generalization☆26Mar 29, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆33Oct 6, 2024Updated last year
- ☆44Apr 8, 2024Updated 2 years ago
- ☆20Nov 10, 2022Updated 3 years ago
- ☆10Jul 5, 2024Updated last year
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆56Apr 7, 2025Updated last year
- CAGNet: Content-Aware Guidance for Salient Object Detection☆33Dec 28, 2020Updated 5 years ago
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding☆10Jul 15, 2023Updated 2 years ago
- Structure-Aware Feature Stylization for Domain Generalization☆12Oct 7, 2023Updated 2 years ago
- Project Page for GaussianFormer☆24May 30, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the official code implementation of Bongard-OpenWorld (ICLR 2024).☆14Jan 6, 2025Updated last year
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆33Jul 8, 2025Updated 10 months ago
- Computer Systems Lab☆13Oct 16, 2025Updated 7 months ago
- Reversal Curse Experiment☆15Sep 24, 2023Updated 2 years ago
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆15Mar 28, 2025Updated last year
- CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning☆15Jun 24, 2024Updated last year
- GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Op…☆31Dec 11, 2025Updated 5 months ago
- NegCLIP.☆41Feb 6, 2023Updated 3 years ago
- Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation☆30Sep 20, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆24Jul 8, 2023Updated 2 years ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆38Aug 18, 2024Updated last year
- larc solving with gpt4☆20May 25, 2023Updated 3 years ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆39Feb 13, 2025Updated last year
- The source code for "MG-BERT: Multi-Graph Augmented BERT for Masked Language Modeling" paper (NAACL 2021, TextGraphs-15).☆12Jun 11, 2021Updated 4 years ago
- The Pix2Code framework: generalizable, interpretable and revisable visual concept learning☆14Oct 7, 2025Updated 7 months ago
- [3DV 2025] CoE: Deep Coupled Embedding for Non-Rigid Point Cloud Correspondences☆20Jan 5, 2026Updated 4 months ago
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- A curated lists of self-taught materials including research blogs☆16Dec 12, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆28Nov 29, 2023Updated 2 years ago
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆141Mar 12, 2026Updated 2 months ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆50Jan 14, 2025Updated last year
- ☆14Jul 17, 2024Updated last year
- Official PyTorch implementation of Agglomerative Token Clustering presented at ECCV 2024☆20Sep 19, 2024Updated last year
- ☆16May 11, 2022Updated 4 years ago