dbstjswo505 / SQuiDNet
☆30Updated 2 years ago
Alternatives and similar repositories for SQuiDNet:
Users that are interested in SQuiDNet are comparing it to the libraries listed below
- Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval☆20Updated 3 years ago
- Dual-scale Doppler Attention for Human Identification☆10Updated 2 years ago
- Winning SubNetwork (WSN), Soft-SubNetwork (SoftNet)☆21Updated 10 months ago
- [ICLR'23] ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure☆16Updated 7 months ago
- ☆13Updated 3 weeks ago
- ☆14Updated 3 years ago
- HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue, EMNLP 2023 (long, findings) [STARLAB] Audio Enhancement for video-dial…☆11Updated last year
- Winning SubNetwork (WSN)☆27Updated last year
- [STARLAB] This repositery is a system to estimate scene complexity in video☆12Updated 2 months ago
- 비디오 기반 인공지능 대화시스템☆14Updated last year
- FRAG: Frequency Adaptive Group for Diffusion Video Editing (ICML 2024)☆29Updated 4 months ago
- Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (…☆24Updated 2 months ago
- ☆31Updated 2 years ago
- CCFDM reinforcement learning☆13Updated 3 years ago
- [ECCV'24] Official code for "BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation"☆15Updated 2 months ago
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆18Updated 11 months ago
- [IEEE Access 2022] AI for detecting BPPV disorders specified by beatings, torsional movements of the eyes☆11Updated 2 years ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆33Updated 4 months ago
- Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"☆29Updated last year
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022☆68Updated 2 years ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆63Updated 6 months ago
- Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …☆42Updated 2 years ago
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Updated 2 years ago
- Causal Localization Network for Radar Human Localization with micro-Doppler signature☆13Updated 3 months ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆27Updated last year
- ICML 2024, Official Implementation of "Cross-view Masked Diffusion Transformers for Person Image Synthesis."☆27Updated 2 months ago
- ☆15Updated last month
- An official implementation for MS-DETR in ACL'23☆16Updated last year
- [ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval☆12Updated last year
- ☆16Updated 2 months ago