JayParanjape / F-ViTALinks
Code for F-ViTA: Foundation Model Guided Visible to Thermal Translation
☆27Updated 5 months ago
Alternatives and similar repositories for F-ViTA
Users that are interested in F-ViTA are comparing it to the libraries listed below
Sorting:
- [Arxiv 2025] DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding☆59Updated last month
- Official implementation of the paper "Complementary Random Masking for RGB-T Semantic Segmentation."☆63Updated last year
- CVPR 2025 | Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond☆89Updated last month
- Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance☆11Updated 3 weeks ago
- Official repo for UniRGB-IR.☆45Updated 3 weeks ago
- Repository for synthetic RGB to Thermal Infrared translation module from "Edge-guided multidomain RGB to TIR translation", ICRA 2023 subm…☆133Updated last year
- [TCSVT 2025] CFMW: Cross-modality Fusion Mamba for Robust Object Detection under Adverse Weather☆79Updated 4 months ago
- ☆22Updated 6 months ago
- Code for PID: Physics-Informed Diffusion Model for Infrared Image Generation☆141Updated 3 months ago
- ☆73Updated 11 months ago
- ☆17Updated 8 months ago
- A foundation model in the infrared modality☆54Updated last year
- RGB-T Fusion, RGB-T SOD, RGB-T Vehicle Detection, RGB-T Crowd Counting, RGB-T Pedestrian Detection, RGB-T Semantic Segmeantaion, RGB-T Tr…☆162Updated 3 weeks ago
- Dataset & Code for ACM Multimedia 2023 paper. "SemanticRT: A Large-Scale Dataset and Method for Robust Semantic Segmentation in Multispec…☆14Updated 8 months ago
- [WACV2025] MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection☆26Updated last year
- [IJCNN 2024] Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation Model☆39Updated last year
- [TCSVT2025] EI2Det: Edge-Guided Illumination-Aware Interactive Learning for Visible-Infrared Object Detection☆25Updated 8 months ago
- This is the offical repository for "Multi-modal Gated Mixture of Local-to-Global Experts for Dynamic Image Fusion" (ICCV 2023).☆68Updated last year
- official implementation of EME☆37Updated 6 months ago
- [CVPR2025] Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection☆24Updated last year
- [ICCV 2023 oral] Official repository of the paper "Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation"☆46Updated last year
- ☆91Updated last year
- Official Implementation of Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling☆182Updated 3 weeks ago
- M3SVD:Multi-Modal Multi-Scene Video Dataset☆48Updated 3 months ago
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆79Updated last year
- ☆16Updated last year
- Calibrated and Complementary Transformer for RGB-Infrared Object Detection☆96Updated last year
- [ICCV23] Official Implementation of CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation☆36Updated last year
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆186Updated 9 months ago
- ☆20Updated 7 months ago