dbstjswo505 / SQuiDNet
☆30Updated 2 years ago
Alternatives and similar repositories for SQuiDNet:
Users that are interested in SQuiDNet are comparing it to the libraries listed below
- Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval☆20Updated 3 years ago
- Dual-scale Doppler Attention for Human Identification☆10Updated 2 years ago
- ☆13Updated 3 months ago
- Winning SubNetwork (WSN), Soft-SubNetwork (SoftNet)☆21Updated last year
- [ICLR'23] ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure☆16Updated 9 months ago
- 비디오 기반 인공지능 대화시스템☆14Updated last year
- [STARLAB] This repositery is a system to estimate scene complexity in video☆12Updated 4 months ago
- HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue, EMNLP 2023 (long, findings) [STARLAB] Audio Enhancement for video-dial…☆11Updated last year
- ☆14Updated 3 years ago
- FRAG: Frequency Adaptive Group for Diffusion Video Editing (ICML 2024)☆31Updated 6 months ago
- ICML 2024, Official Implementation of "Cross-view Masked Diffusion Transformers for Person Image Synthesis."☆28Updated 4 months ago
- [ECCV'24] Official code for "BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation"☆16Updated 4 months ago
- [IEEE Access 2022] AI for detecting BPPV disorders specified by beatings, torsional movements of the eyes☆11Updated 2 years ago
- Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (…☆24Updated 4 months ago
- ☆31Updated 3 years ago
- Causal Localization Network for Radar Human Localization with micro-Doppler signature☆13Updated 6 months ago
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆50Updated last year
- Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).☆47Updated 2 years ago
- ☆16Updated 4 months ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆66Updated 9 months ago
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Updated 2 years ago
- Winning SubNetwork (WSN)☆28Updated last year
- Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.☆10Updated 2 years ago
- ☆26Updated 4 years ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆35Updated 7 months ago
- [NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding☆47Updated last year
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022☆69Updated 2 years ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆18Updated 2 months ago
- An official implementation for MS-DETR in ACL'23☆16Updated last year
- ☆25Updated 2 weeks ago