JayParanjape / F-ViTAView external linksLinks
Code for F-ViTA: Foundation Model Guided Visible to Thermal Translation
☆29Jun 29, 2025Updated 7 months ago
Alternatives and similar repositories for F-ViTA
Users that are interested in F-ViTA are comparing it to the libraries listed below
Sorting:
- Official implementation the paper "Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Anaylsis"☆25Jan 29, 2025Updated last year
- Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance☆13Nov 27, 2025Updated 2 months ago
- An unofficial PyTorch implementation of SuperThermal: Matching Thermal as Visible Through Thermal Feature Exploration☆16Jun 9, 2023Updated 2 years ago
- DEAL: Data-Efficient Adversarial Learning for High-Quality Infrared Imaging (CVPR 25)☆19Aug 20, 2025Updated 5 months ago
- ☆17Apr 9, 2025Updated 10 months ago
- ☆16Oct 29, 2024Updated last year
- ☆37May 4, 2022Updated 3 years ago
- Official repository for the paper "Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor" by Bhattacharya, et al. (2024) f…☆35Feb 5, 2026Updated last week
- Open-Vocabulary Panoptic Segmentation☆27Jun 15, 2025Updated 7 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- Code for PID: Physics-Informed Diffusion Model for Infrared Image Generation☆152Sep 16, 2025Updated 4 months ago
- ☆77Jan 10, 2025Updated last year
- This is the official GitHub page of the Multi-Spectral Stereo (MS2) dataset described in CVPR 2023 paper.☆81Jan 17, 2024Updated 2 years ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- Official repo for UniRGB-IR.☆49Nov 28, 2025Updated 2 months ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- ☆16Jul 29, 2025Updated 6 months ago
- ☆11Jun 30, 2025Updated 7 months ago
- ☆10Apr 7, 2025Updated 10 months ago
- Progressive Language-guided Visual Learning for Multi-Task Visual Grounding☆13May 9, 2025Updated 9 months ago
- ☆10Jan 9, 2025Updated last year
- ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation. AAAI, 2025☆13Aug 25, 2025Updated 5 months ago
- Official repository for "Thermal Chameleon Net: Task-Adaptive Tone-mapping for Thermal-Infrared images"☆12Nov 18, 2025Updated 2 months ago
- Agentic Keyframe Search for Video Question Answering☆15Apr 7, 2025Updated 10 months ago
- ☆13Jan 21, 2025Updated last year
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated last year
- ☆11Feb 4, 2024Updated 2 years ago
- The official repository of UVOSAM☆13Jun 5, 2024Updated last year
- ☆10Mar 30, 2023Updated 2 years ago
- ROS thermal camera driver for Gige-V FLIR Thermal cameras supported by Spinnaker SDK☆12Sep 6, 2023Updated 2 years ago
- Aggregate and Discriminate: Pseudo Clips-Guided Boundary Perception for Video Moment Retrieval☆12Nov 25, 2024Updated last year
- This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.☆10Jan 26, 2024Updated 2 years ago
- Source code of " LIVENet: A novel network for real-world low-light image denoising and enhancement", published in WACV 2024☆13Dec 20, 2023Updated 2 years ago
- Qwen-SAM is a reasoning-based segmentation model that integrates Qwen 2.5 VL 7B with the Segment Anything Model (SAM), enabling fine-grai…☆24Jun 4, 2025Updated 8 months ago
- This is the model and the inference code provided with the paper "Narrowing the Synthetic-to-Real Gap for Thermal Infrared Semantic Image…☆12Jun 14, 2024Updated last year
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆37Oct 9, 2025Updated 4 months ago
- ☆13Jun 21, 2022Updated 3 years ago