HanSeokhyeon / Deep_learning_for_Phoneme_recognition
다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.
☆13Updated 5 years ago
Alternatives and similar repositories for Deep_learning_for_Phoneme_recognition:
Users that are interested in Deep_learning_for_Phoneme_recognition are comparing it to the libraries listed below
- End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆10Updated 3 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆66Updated 3 years ago
- A short tutorial on Keras for the co-utilization of audio and text data (multi-modal analysis)☆16Updated 2 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 4 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Updated 7 years ago
- Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)☆30Updated 4 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 4 years ago
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.☆19Updated 4 years ago
- RNN-Transducer for korean☆40Updated 4 years ago
- Refactored version of https://github.com/ming024/FastSpeech2☆13Updated 3 years ago
- ☆16Updated 4 months ago
- Convert Numerical Representations to Korean Pronunciation☆14Updated 4 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆45Updated 3 years ago
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Updated 4 years ago
- PyTorch based speaker embedding model☆15Updated 10 months ago
- ☆83Updated 2 years ago
- ☆9Updated last month
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Updated 3 years ago
- 2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib☆22Updated 4 years ago
- Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)☆13Updated 4 years ago
- Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.☆35Updated 3 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- Repository for speech paper reading☆33Updated 3 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- An evaluation toolkit for voice conversion models.☆40Updated 3 years ago
- Paper Review about Speech Recognition · NLP☆9Updated 3 years ago
- Korean ASR Corpus generated from TEDx talks☆27Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- A pakage for crawling audio from Youtube☆41Updated last year
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago