Mind the Modality Gap: Towards a Remote Sensing Vision-Language Model via Cross-modal Alignment
☆20Oct 16, 2024Updated last year
Alternatives and similar repositories for MindTheModalityGap
Users that are interested in MindTheModalityGap are comparing it to the libraries listed below
Sorting:
- Hephaestus: A large scale multitask dataset towards InSAR understanding☆74Jul 3, 2023Updated 2 years ago
- ☆16Mar 13, 2025Updated 11 months ago
- Code and data for Kuro Siwo flood mapping dataset☆73Sep 24, 2025Updated 5 months ago
- GAIA: A global, multimodal, multiscale vision–language dataset for remote sensing image analysis☆31Feb 11, 2026Updated 3 weeks ago
- ☆55Mar 14, 2025Updated 11 months ago
- Code and models for efficient training on the BigEarthNet dataset for Land Use Land Cover classification☆74Dec 7, 2022Updated 3 years ago
- Sen4AgriNet: A Sentinel-2 multi-year, multi-country benchmark dataset for crop classification and segmentation with deep learning☆110Nov 6, 2024Updated last year
- IGARSS 2024 Tutorial "A Practical Session on Deep Learning Advances for Monitoring and forecasting Natural Hazards"☆31Jul 7, 2024Updated last year
- An open and accessible datacube for environmental and earth system monitoring☆38Sep 27, 2024Updated last year
- Teleconnection-driven vision transformers for improved long-term forecasting☆34Nov 1, 2023Updated 2 years ago
- ☆10Nov 2, 2023Updated 2 years ago
- ☆13May 13, 2024Updated last year
- A method to generate counterfactuals☆12Feb 24, 2026Updated last week
- Deep Learning for monitoring and forecasting natural hazards with earth observation data☆14Jul 17, 2023Updated 2 years ago
- A user-friendly introduction to the multi-modal BigEarthNet-MM dataset.☆54Oct 27, 2024Updated last year
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆14Feb 23, 2026Updated last week
- [ICCAD 2025] Squant☆15Jul 3, 2025Updated 8 months ago
- ☆11Oct 16, 2020Updated 5 years ago
- Bencharking pipeline for evaluating Transcriptomic representations for perturbation tasks☆12Nov 5, 2024Updated last year
- This is the official code repository for the "Gradient-Guided Annealing for Domain Generalization" (CVPR 2025) paper.☆18Jul 22, 2025Updated 7 months ago
- Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching, ICML2025 (Spotlight)☆28Aug 11, 2025Updated 6 months ago
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025☆16Nov 24, 2024Updated last year
- Official implementation and checkpoints of GeoLink remote sensing foundation model in NeurIPS2025.☆53Oct 6, 2025Updated 5 months ago
- ESDL Cube Generation and Access API☆16Oct 16, 2020Updated 5 years ago
- [ECCV 2024] Code for the paper "Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network"☆17Jul 27, 2024Updated last year
- ☆15Jun 2, 2022Updated 3 years ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- Several baseline models and PJFNN on Job Recommendation Challenge☆23Aug 14, 2023Updated 2 years ago
- [ICCV 2025] Towards Foundational Models for Single-Chip Radar☆29Oct 25, 2025Updated 4 months ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆18Feb 25, 2025Updated last year
- Sound field estimation based on physics-constrained neural kernel☆21Jun 9, 2025Updated 8 months ago
- A dataset with Space (Sentinel-1/2) and Ground (street-level images) components, annotated with crop-type labels for agriculture monitori…☆22Aug 2, 2022Updated 3 years ago
- ☆26Apr 30, 2025Updated 10 months ago
- ☆19Sep 26, 2024Updated last year
- ☆17May 14, 2020Updated 5 years ago
- body25 + hand pose3d to bvh☆17Jul 19, 2022Updated 3 years ago
- [ICML'25] MedTok: Multimodal Medical Code Tokenizer☆38Jul 20, 2025Updated 7 months ago
- Official Pytorch implementation of Dynamic-Token-Pruning (ICCV2023)☆22Sep 28, 2023Updated 2 years ago
- Official code implementation for our paper -- Direct Inversion: Optimization-Free Text-Driven Real Image Editing with Diffusion Models.☆27Nov 18, 2022Updated 3 years ago