Official repository of our work: MS-DETR: Multispectral Pedestrian Detection Transformer with Loosely Coupled Fusion and Modality-Balanced Optimization
☆24Sep 8, 2024Updated last year
Alternatives and similar repositories for MS-DETR
Users that are interested in MS-DETR are comparing it to the libraries listed below
Sorting:
- GM-DETR: Generalized Muiltispectral DEtection TRansformer with Efficient Fusion Encoder for Visible-Infrared Detection (Paddle&Torch)☆43Aug 27, 2024Updated last year
- [WACV2025] MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection☆27Dec 9, 2024Updated last year
- Code of Paper OmniFuse: Composite Degradation-Robust Image Fusion with Language-Driven Semantics.☆28Sep 16, 2025Updated 5 months ago
- Improving Multispectral Pedestrian Detection by Addressing Modality Imbalance Problems (ECCV 2020)☆124Sep 18, 2020Updated 5 years ago
- ☆62Jul 23, 2024Updated last year
- An enhanced version of the YOLOv4 detector, which is a middle fusion method and combines RGB and thermal images for pedestrian detection.☆22Jan 24, 2022Updated 4 years ago
- This is an official repository of our TFDet.☆68Dec 8, 2023Updated 2 years ago
- Use visible and infrared images to train the network. This method is better to face the dark environment.☆119May 18, 2023Updated 2 years ago
- Weakly Aligned Cross-Modal Learning for Multispectral Pedestrian Detection, ICCV, 2019☆75Aug 18, 2020Updated 5 years ago
- ICAFusion: Iterative Cross-Attention Guided Feature Fusion for Multispectral Object Detection, Pattern Recognition☆241Oct 26, 2025Updated 4 months ago
- ☆14Jan 9, 2025Updated last year
- ☆11Oct 18, 2022Updated 3 years ago
- A personal network disk based on cloudflare worker and R2 object storage, used to save and manage some files online☆10Dec 29, 2023Updated 2 years ago
- ☆11Jan 12, 2023Updated 3 years ago
- Repo for "Synergy of Sight and Semantics: Visual Intention Understanding with CLIP"☆12Mar 12, 2025Updated 11 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆12Oct 9, 2024Updated last year
- ☆12Nov 11, 2024Updated last year
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆47Dec 6, 2024Updated last year
- Cross-Modality Fusion Mechanism for Multispectral Object Detection☆13Oct 11, 2022Updated 3 years ago
- ☆13Oct 25, 2024Updated last year
- (2025' IJCV) This is the offical implementation for the paper titled "FusionBooster: A Unified Image Fusion Boosting Paradigm".☆14Jul 23, 2025Updated 7 months ago
- ☆12Dec 5, 2022Updated 3 years ago
- ☆14Jun 21, 2023Updated 2 years ago
- Dual convolutional neural network with attention for image blind denoising (Multimedia Systems, 2024)☆15Oct 25, 2024Updated last year
- ☆12Sep 3, 2021Updated 4 years ago
- ☆11Apr 28, 2024Updated last year
- This is official tensorflow implementation of “PIAFusion: A Progressive Infrared and Visible Image Fusion Network Based on Illumination A…☆122Jun 12, 2024Updated last year
- ☆13Dec 6, 2024Updated last year
- ☆15May 21, 2024Updated last year
- ☆11Apr 27, 2022Updated 3 years ago
- Pytorch implementation of TSE attention☆16Jul 9, 2021Updated 4 years ago
- Package which implements the Natural Image Quality Evaluator (NIQE)☆16Oct 16, 2020Updated 5 years ago
- ☆17Dec 11, 2023Updated 2 years ago
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆16Jul 1, 2025Updated 8 months ago
- [ICIP 2020]"Multispectral Fusion for Object Detection with Cyclic Fuse-and-Refine Blocks"☆13Oct 6, 2020Updated 5 years ago
- CSANet: Cross-Temporal Interaction Symmetric Attention Network for Hyperspectral Image Change Detection☆12Sep 13, 2022Updated 3 years ago
- ☆16May 10, 2024Updated last year
- Source codes of the our paper titled "Multi-level Textual-Visual Alignment and Fusion Network for Multimodal Aspect-based Sentiment Analy…☆15Apr 23, 2024Updated last year