dbstjswo505 / HEARLinks
HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue, EMNLP 2023 (long, findings) [STARLAB] Audio Enhancement for video-dialogue system
☆19Updated last year
Alternatives and similar repositories for HEAR
Users that are interested in HEAR are comparing it to the libraries listed below
Sorting:
- SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval (ICCV'2023), [STARLAB] This repositery is a system to…☆21Updated 3 months ago
- 비디오 기반 인공지능 대화시스템☆14Updated last year
- ☆14Updated 3 years ago
- Winning SubNetwork (WSN), Soft-SubNetwork (SoftNet)☆21Updated last year
- ☆13Updated 6 months ago
- Dual-scale Doppler Attention for Human Identification☆18Updated 2 years ago
- [ICLR'23] ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure☆16Updated last year
- FRAG: Frequency Adaptive Group for Diffusion Video Editing (ICML 2024)☆34Updated 10 months ago
- [IEEE Access 2022] AI for detecting BPPV disorders specified by beatings, torsional movements of the eyes☆11Updated 2 years ago
- Causal Localization Network for Radar Human Localization with micro-Doppler signature☆23Updated 9 months ago
- Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval☆28Updated 3 years ago
- Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (…☆25Updated 8 months ago
- [ECCV'24] Official code for "BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation"☆16Updated 8 months ago
- ☆25Updated 4 months ago
- CCFDM reinforcement learning☆13Updated 3 years ago
- ☆38Updated 2 years ago
- ☆17Updated 8 months ago
- ☆25Updated 4 months ago
- MSIT AI Fair(MAF)☆39Updated 2 months ago
- ☆38Updated 6 months ago
- AI Development in Evolving Policy [AI DEP]☆46Updated last week
- Fast and Efficient MMD-based Fair PCA via Optimization over Stiefel Manifold (AAAI 2022)☆12Updated 2 years ago
- This repository is the official implementation of the paper: Physics Informed Distillation for Diffusion Models, accepted by Transactions…☆27Updated 6 months ago
- Winning SubNetwork (WSN)☆31Updated last year
- ICML 2024, Official Implementation of "Cross-view Masked Diffusion Transformers for Person Image Synthesis."☆29Updated 8 months ago
- [ECCV 2024] FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing☆35Updated last week
- ☆17Updated 2 years ago
- This is an official implementation of our work, Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on V…☆13Updated 6 months ago
- ☆31Updated last week
- LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos. (CVPR 2025))☆40Updated last month