warnikchow / coaudiotextLinks
A short tutorial on Keras for the co-utilization of audio and text data (multi-modal analysis)
☆16Updated 2 years ago
Alternatives and similar repositories for coaudiotext
Users that are interested in coaudiotext are comparing it to the libraries listed below
Sorting:
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Updated 5 years ago
- End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆10Updated 3 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 3 years ago
- Multimodal Transformer for Korean Sentiment Analysis with Audio and Text Features☆28Updated 3 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 4 years ago
- ☆87Updated 2 years ago
- Prosody-semantics Interface in Seoul Korean☆12Updated 4 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Updated 5 years ago
- Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식☆21Updated 4 years ago
- Korean text normalization and language preparation package for LM in Kaldi-based ASR system☆62Updated 5 years ago
- PyTorch based speaker embedding model☆16Updated last year
- Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)☆30Updated 4 years ago
- 2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib☆22Updated 5 years ago
- 🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed☆31Updated 3 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 4 years ago
- ☆37Updated 4 years ago
- Korean Speech to English Translation Corpus☆44Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated 2 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆46Updated 4 years ago
- Review of papers I read☆14Updated 4 years ago
- A pakage for crawling audio from Youtube☆42Updated 2 years ago
- 발화자 지정 모듈☆21Updated 6 months ago
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated 2 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157Updated 2 years ago
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.☆19Updated 4 years ago
- cross-modal model between audio(MFCC) and text(KoBERT)☆12Updated 4 years ago
- Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)☆13Updated 4 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Updated 4 years ago
- ☆51Updated 4 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Updated 4 years ago