dbstjswo505 / SQuiDNet
☆30Updated 2 years ago
Alternatives and similar repositories for SQuiDNet:
Users that are interested in SQuiDNet are comparing it to the libraries listed below
- Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval☆20Updated 3 years ago
- Dual-scale Doppler Attention for Human Identification☆10Updated 2 years ago
- Winning SubNetwork (WSN), Soft-SubNetwork (SoftNet)☆21Updated 11 months ago
- [ICLR'23] ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure☆16Updated 8 months ago
- ☆13Updated 2 months ago
- HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue, EMNLP 2023 (long, findings) [STARLAB] Audio Enhancement for video-dial…☆11Updated last year
- ☆14Updated 3 years ago
- ICML 2024, Official Implementation of "Cross-view Masked Diffusion Transformers for Person Image Synthesis."☆28Updated 3 months ago
- [ECCV'24] Official code for "BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation"☆15Updated 3 months ago
- Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (…☆24Updated 3 months ago
- FRAG: Frequency Adaptive Group for Diffusion Video Editing (ICML 2024)☆29Updated 5 months ago
- 비디오 기반 인공지능 대화시스템☆14Updated last year
- [STARLAB] This repositery is a system to estimate scene complexity in video☆12Updated 3 months ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆35Updated 5 months ago
- ☆24Updated last month
- Causal Localization Network for Radar Human Localization with micro-Doppler signature☆13Updated 4 months ago
- [IEEE Access 2022] AI for detecting BPPV disorders specified by beatings, torsional movements of the eyes☆11Updated 2 years ago
- ☆31Updated 2 years ago
- ☆16Updated 3 months ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆27Updated last year
- code for downloading videos from HowTo100M dataset☆14Updated 3 years ago
- [ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval☆12Updated last year
- Fast and Efficient MMD-based Fair PCA via Optimization over Stiefel Manifold (AAAI 2022)☆12Updated 2 years ago
- ☆15Updated 2 months ago
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆49Updated last year
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆63Updated 7 months ago
- ☆24Updated last month
- ☆26Updated 5 months ago
- A reading list of papers about Visual Grounding.☆31Updated 2 years ago
- ☆16Updated 3 years ago