kakaobrain / jejueo
Jejueo Datasets for Machine Translation and Speech Synthesis
β76Updated 4 years ago
Related projects β
Alternatives and complementary repositories for jejueo
- Korean text normalization and language preparation package for LM in Kaldi-based ASR systemβ59Updated 4 years ago
- π Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeedβ31Updated 2 years ago
- Korean Speech to English Translation Corpusβ42Updated 3 years ago
- 5-class Korean speech emotion classifierβ30Updated last year
- β9Updated last week
- ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)β218Updated 2 years ago
- β70Updated 3 years ago
- Korean grapheme-to-phone conversion in Pythonβ127Updated 4 years ago
- PyTorch v1.2μμ μκΈ΄ Transformer API λ₯Ό μ΄μ©ν κ°λ¨ν Chitchat μ±λ΄β49Updated 5 years ago
- Structured argument extraction for Koreanβ22Updated 2 years ago
- β82Updated last year
- β18Updated 3 years ago
- Korean ALBERTβ47Updated 5 years ago
- Real-time automatic word segmentation (for user-generated texts)β21Updated last year
- Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.β89Updated 2 years ago
- Adversarially Trained End-to-end Korean SInging Voice Synthesis Systemβ54Updated 4 years ago
- g2pK: g2p module for Koreanβ236Updated 2 years ago
- Korean-English Bilingual Electra Modelsβ109Updated 2 years ago
- xor activationβ26Updated 4 years ago
- Intonation-aided intention identification for Koreanβ85Updated 2 years ago
- λ€μ΄λ² λ΄μ€ μ€ IT/κ³Όν λΆμΌμμ 50κ°λ₯Ό μ μ ν΄μ μμ½μ ν΄λΉνλ λ¬Έμ₯μ νκΉ ν΄λ λ°μ΄ν°μ μ λλ€.β39Updated 7 years ago
- Korean ASR Corpus generated from TEDx talksβ27Updated 5 years ago
- Recurrent Neural Network based Hate Speech Language Model for Korean Hate Speech Detectionβ24Updated 4 years ago
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.β19Updated 3 years ago
- π¦ νμ΄μ¬ νκΈ μ²λ¦¬ λΌμ΄λΈλ¬λ¦¬. Python Korean Morphological Analyzerβ20Updated last month
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Languageβ43Updated 6 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)β116Updated 4 years ago