dbstjswo505 / HEAR
HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue, EMNLP 2023 (long, findings) [STARLAB] Audio Enhancement for video-dialogue system
☆11Updated 8 months ago
Related projects: ⓘ
- 비디오 기 반 인공지능 대화시스템☆14Updated 8 months ago
- [STARLAB] This repositery is a system to estimate scene complexity in video☆12Updated 8 months ago
- ☆14Updated 2 years ago
- [IEEE Access 2022] AI for detecting BPPV disorders specified by beatings, torsional movements of the eyes☆9Updated last year
- ☆13Updated last year
- [ICLR'23] ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure☆16Updated 3 months ago
- Winning SubNetwork (WSN), Soft-SubNetwork (SoftNet)☆21Updated 6 months ago
- CCFDM reinforcement learning☆12Updated 2 years ago
- Dual-scale Doppler Attention for Human Identification☆11Updated last year
- Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval☆20Updated 2 years ago
- FRAG: Frequency Adaptive Group for Diffusion Video Editing (ICML 2024)☆19Updated 2 weeks ago
- Fast and Efficient MMD-based Fair PCA via Optimization over Stiefel Manifold (AAAI 2022)☆11Updated last year
- ☆28Updated last year
- Winning SubNetwork (WSN)☆26Updated 8 months ago
- Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (…☆10Updated 2 months ago
- This is official repository for Dual Temperature Helps Contrastive Learning without Many Negative Samples (CVPR2022)☆29Updated last year
- ☆13Updated 9 months ago
- Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos☆13Updated 4 months ago
- ☆61Updated 9 months ago
- ☆10Updated 2 months ago
- Research code for NeurIPS 2023 paper "Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser"☆15Updated 11 months ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆16Updated 3 weeks ago
- ☆17Updated last year
- Official PyTorch Implementation for CVPR'23 Paper, "The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training"☆18Updated 9 months ago
- ☆17Updated last year
- ☆17Updated last year
- Archive for AI grand challenge☆21Updated last year
- cross modal background suppression for audio-visual event localization☆34Updated 2 years ago
- Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)☆13Updated 2 years ago
- [WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"☆11Updated 3 weeks ago