Hyejin-Koo / EE4178
Introduction to Artificial Intelligence(Deep Learning)
☆7Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for EE4178
- Look Who’s Talking: Active Speaker Detection in the Wild☆72Updated last year
- ☆19Updated last week
- Sound Source Localization for AI Grand Challenge 2021☆21Updated 2 years ago
- Grand Challenge 4 track 2 sourcecode developed by GIST☆13Updated 3 years ago
- Audio Only Speech Enhancement using Unet☆9Updated 3 years ago
- Sound Source Localization for AI Grand Challenge 2021☆21Updated 2 years ago
- ☆21Updated 3 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆34Updated last year
- Audio event detection model based on YOLOX☆85Updated last year
- ☆82Updated last year
- 오디오 전처리 작업을 위한 연습☆25Updated 5 years ago
- Cross attentive pooling for speaker verification (IEEE SLT, 2021)☆12Updated 3 years ago
- cross-modal model between audio(MFCC) and text(KoBERT)☆12Updated 3 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆77Updated 3 years ago
- Korean ASR using PyTorch / Listen, Attend and Spell (LAS) / Seq2seq with Attention / Naver-A.I-Hackathon-Speech / A.I Hub Dataset / 한국…☆10Updated 4 years ago
- ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)☆218Updated 2 years ago
- Sound Source Localization for PCM-A10 Microphone☆36Updated last year
- 3rd Grand Challenge track 3 DB developed by GIST☆36Updated 3 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 2 years ago
- A pytorch implementation of MFCC.☆33Updated 2 years ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆73Updated 4 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆19Updated 2 years ago
- RNN-Transducer for korean☆39Updated 4 years ago
- 한국어 음성인식 튜토리얼☆64Updated 4 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Updated last year
- 2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib☆23Updated 4 years ago
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆35Updated 4 years ago
- All codes implemented on Korean voice phishing detection papers☆8Updated 4 months ago
- ☆62Updated 2 months ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆24Updated 2 years ago