YoucanBaby / MH-DETR
MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
☆16Updated 9 months ago
Alternatives and similar repositories for MH-DETR
Users that are interested in MH-DETR are comparing it to the libraries listed below
Sorting:
- ☆36Updated 10 months ago
- The simplest tyron pipeline!最简单的aigc换装算法!☆49Updated 3 months ago
- [ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"☆38Updated 4 months ago
- CoS: Chain-of-Shot Prompting for Long Video Understanding☆47Updated 3 months ago
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆110Updated 7 months ago
- ☆318Updated last month
- ☆43Updated 3 weeks ago
- ☆152Updated 3 months ago
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆124Updated 8 months ago
- ☆208Updated last month
- https://www.kaggle.com/competitions/sorghum-id-fgvc-9☆19Updated 2 years ago
- Official repository of MMGenBench☆120Updated 2 months ago
- ☆91Updated last year
- [CVPR 2025] The code and model for our paper "Shadow Generation Using Diffusion Model with Geometry Prior", CVPR, 2025.☆109Updated 3 weeks ago
- linkedin, seek job information crawler☆103Updated 3 weeks ago
- Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer☆95Updated last week
- ☆50Updated last month
- [ICRA 2024] Official code for BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection☆2Updated 10 months ago
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆157Updated 6 months ago
- Domain Prompt Learning with Quaternion Networks (CVPR2024 Highlight)☆77Updated 4 months ago
- This is a useful development tool that supports mocking for both GraphQL and RESTful APIs.☆22Updated 10 months ago
- Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition☆311Updated 8 months ago
- ☆160Updated 7 months ago
- [IEEE Transactions on Multimedia 2024.] Lightweight-Adaptive-Feature-De-drifting-for-Compressed-Image-Classification.☆89Updated 5 months ago
- ☆132Updated last week
- Official implemetation of "Enhancing Close-up Novel View Synthesis via Pseudo-labeling" [AAAI 2025]☆13Updated last month
- Example project using universal links as deeplinks to switch iOS apps.☆13Updated 9 months ago
- 第五届字节跳动青训营后端进阶班-大项目极简版抖音-基于Kitex + Hertz + Gorm 的分布式视频APP服务端☆43Updated last year
- A PyTorch implementation of diffusion models built from scratch☆38Updated last month
- sisuolv / 2021--CCF-Big-Data-Computing-Intelligence-Contest--System-authentication-risk-prediction--1sthttps://www.datafountain.cn/competitions/537☆19Updated 2 years ago