Hyejin-Koo / EE4178
Introduction to Artificial Intelligence(Deep Learning)
☆7Updated 3 years ago
Alternatives and similar repositories for EE4178:
Users that are interested in EE4178 are comparing it to the libraries listed below
- For Korean speech emotion detect, this model is trained by Korean dataset. There is no enough Korean dataset, so i tried to make this rep…☆9Updated 2 years ago
- 한국어 음성인식 튜토리얼☆65Updated 4 years ago
- Korean ASR using PyTorch / Listen, Attend and Spell (LAS) / Seq2seq with Attention / Naver-A.I-Hackathon-Speech / A.I Hub Dataset / 한국…☆11Updated 5 years ago
- ☆13Updated 4 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Updated 2 years ago
- ☆83Updated 2 years ago
- 2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib☆22Updated 4 years ago
- Sound Source Localization for AI Grand Challenge 2021☆21Updated 3 years ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆71Updated last year
- ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)☆220Updated 2 years ago
- 한국어 음성 인식을 위한 deep speech 2☆28Updated 4 years ago
- Sound Source Localization for AI Grand Challenge 2021☆21Updated 3 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆33Updated 2 years ago
- Official implementation of Transpotter, published in BMVC 2021☆16Updated 2 years ago
- RNN-Transducer for korean☆41Updated 4 years ago
- 오디오 전처리 작업을 위한 연습☆25Updated 5 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 3 years ago
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆32Updated last year
- Tacotron2 for Korean (taKotron2)☆34Updated 2 years ago
- A pytorch implementation of MFCC.☆33Updated 2 years ago
- Sound Source Localization for PCM-A10 Microphone☆35Updated 2 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆25Updated 2 years ago
- cross-modal model between audio(MFCC) and text(KoBERT)☆12Updated 4 years ago
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆29Updated last year
- STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지☆62Updated last year
- Tensorflow Implementation of "Slowing Down the Weight Norm Increase in Momentum-based Optimizers"☆47Updated 3 years ago
- Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.☆90Updated 3 years ago
- Audio Only Speech Enhancement using Unet☆9Updated 4 years ago
- Audio Signal Processing & Speech Recognition☆26Updated 4 years ago
- Faster R-CNN paper review and code implementation from chenyuntc☆12Updated 3 years ago