YoucanBaby / MH-DETRLinks
MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
☆18Updated last year
Alternatives and similar repositories for MH-DETR
Users that are interested in MH-DETR are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 Highlight] Official Implementation of SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity☆110Updated 3 months ago
- ☆206Updated 4 months ago
- ☆66Updated last month
- **Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.☆182Updated last week
- Official repository of MMGenBench☆120Updated 6 months ago
- Practical New Tasks and Inspiring Modeling Solutions for Diverse Open Vision Problems☆56Updated 3 weeks ago
- The summary of code and paper for unified model towards context-dependent (CD) concept segmentation.☆116Updated last month
- (IJCV 2024 & ACM MM 2021 Oral) Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation☆119Updated 3 years ago
- PySegMetrics (PSM): A Python-based Simple yet Efficient Evaluation Toolbox for Segmentation-like tasks☆123Updated last year
- A curated collection of AI+X papers published in Nature / Science / Cell / Lancet / Radiology and their flagship sub-journals☆55Updated last month
- The code for TPAMI paper "Text-Guided Human Image Manipulation via Image-Text Shared Space"☆86Updated 3 years ago
- Extended Agriculture-Vision Dataset: A continuous work of Agriculture-Vision, with great collaborators to bring Agriculture and Computer …☆244Updated 7 months ago
- EmoBench-M: A benchmark for evaluating Emotional Intelligence in Multimodal Large Language Models.☆114Updated last month
- (CVPR 2024 & arXiv 2025) Power Battery Detection☆306Updated this week
- Butter is a novel 2D object detection framework designed to enhance hierarchical feature representations for improved detection robustnes…☆83Updated last month
- PyTorch implementation for "Unlearning the Noisy Correspondence Makes CLIP More Robust (ICCV 2025)"☆40Updated last week
- [ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"☆40Updated 9 months ago
- CVPR2025☆43Updated 6 months ago
- (ICML 2024) Spider: A Unified Framework for Context-dependent Concept Segmentation☆348Updated 6 months ago
- Offical Code of MICCAI'24 early accepted paper "LGRNet: Local-Global Reciprocal Network for Uterine Fibroid Segmentation in Ultrasound Vi…☆169Updated last year
- Code for paper 'Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity…☆92Updated last year
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆484Updated 2 months ago
- ☆191Updated 2 weeks ago
- CAASR: A Real-World Animation Super-Resolution Benchmark with Color Degradation and Multi-Scale Multi-Frequency Alignment☆82Updated last month
- [Accepted by Information Fusion] Official code of the paper "Relational Representation Learning Network for Cross-Spectral Image Patch Ma…☆34Updated 2 weeks ago
- Residual Kolmogorov-Arnold Network (RKAN) is designed to enhance the performance of classic deep learning models.☆271Updated last month
- A collection of diffusion inversion methods.☆95Updated last month
- ☆209Updated 2 months ago
- Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dep…☆566Updated last month
- A PyTorch implementation of diffusion models built from scratch☆38Updated 5 months ago