Hyejin-Koo / EE4178
Introduction to Artificial Intelligence(Deep Learning)
☆7Updated 3 years ago
Related projects: ⓘ
- Look Who’s Talking: Active Speaker Detection in the Wild☆70Updated last year
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆19Updated 2 years ago
- Sound Source Localization for AI Grand Challenge 2021☆21Updated 2 years ago
- ☆17Updated 4 months ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆9Updated 11 months ago
- ☆21Updated 3 years ago
- Cross attentive pooling for speaker verification (IEEE SLT, 2021)☆12Updated 3 years ago
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆32Updated 7 months ago
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆12Updated 5 months ago
- Sound Source Localization for AI Grand Challenge 2021☆21Updated 2 years ago
- Audio event detection model based on YOLOX☆84Updated last year
- Official code for Metric learning for user-defined keyword spotting☆19Updated 7 months ago
- Augmentation adversarial training for self-supervised speaker recognition☆77Updated 3 years ago
- Official implementation of Transpotter, published in BMVC 2021☆12Updated 2 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆81Updated last year
- A pytorch implementation of MFCC.☆34Updated 2 years ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆73Updated 4 years ago
- 오디오 전처리 작업을 위한 연습☆25Updated 5 years ago
- ☆12Updated 3 months ago
- PyTorch implementation of "Distinguishing Homophenes using Multi-Head Visual-Audio Memory" (AAAI2022)☆24Updated 6 months ago
- Korean ASR using PyTorch / Listen, Attend and Spell (LAS) / Seq2seq with Attention / Naver-A.I-Hackathon-Speech / A.I Hub Dataset / 한국…☆10Updated 4 years ago
- Code for the Active Speakers in Context Paper (CVPR2020)☆53Updated 3 years ago
- ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)☆218Updated 2 years ago
- Grand Challenge 4 track 2 sourcecode developed by GIST☆13Updated 3 years ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆51Updated 2 months ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆63Updated 2 years ago
- cross-modal model between audio(MFCC) and text(KoBERT)☆12Updated 3 years ago
- ☆82Updated last year
- Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"☆112Updated 3 years ago
- ☆61Updated last week