warnikchow / coaudiotext
A short tutorial on Keras for the co-utilization of audio and text data (multi-modal analysis)
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for coaudiotext
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Updated 4 years ago
- Multimodal Transformer for Korean Sentiment Analysis with Audio and Text Features☆27Updated 3 years ago
- End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆10Updated 2 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆63Updated 2 years ago
- Prosody-semantics Interface in Seoul Korean☆12Updated 4 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 3 years ago
- Refactored version of https://github.com/ming024/FastSpeech2☆13Updated 3 years ago
- Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)☆30Updated 3 years ago
- PyTorch based speaker embedding model☆15Updated 6 months ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Updated 4 years ago
- A pakage for crawling audio from Youtube☆41Updated last year
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 3 years ago
- ☆82Updated last year
- Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식☆21Updated 3 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆30Updated 3 years ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).☆42Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- ☆9Updated last year
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 3 years ago
- The python implementation for paper "Towards Discriminative Representation Learning for Speech Emotion Recognition" in IJCAI-2019☆22Updated 5 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- Using speaker embedding for diarization in PyTorch☆18Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Updated 6 years ago
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.☆19Updated 3 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Updated 3 years ago
- Accompany code to reproduce the baselines of the International Multimodal Sentiment Analysis Challenge (MuSe 2020).☆16Updated last year
- ☆16Updated 3 weeks ago