indonesian-nlp / wav2vec2-indonesianLinks
☆20Updated 4 years ago
Alternatives and similar repositories for wav2vec2-indonesian
Users that are interested in wav2vec2-indonesian are comparing it to the libraries listed below
Sorting:
- Multilingual Speech Recognition for Indonesian Languages☆67Updated 3 years ago
- Indonesian Grapheme-to-Phoneme (IPA notation)☆40Updated 2 years ago
- Automatic Speech Recognition for Indonesian☆18Updated 4 years ago
- Automatic speech recognition (ASR) for Indonesian language built by using HTK and Julius. Web interface is built using Node.js.☆21Updated 9 years ago
- Welcome to our repository! This repository hosts the data on "IndoCollex: A Testbed for Morphological Transformation of Indonesian Word …☆23Updated 4 years ago
- A curated list of research papers and resources on Indonesian languages☆40Updated last year
- g2p ID: Indonesian Grapheme-to-Phoneme Converter☆27Updated last year
- NLP Datasets for Indonesian☆124Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Updated last year
- Indonesian TTS (text-to-speech) using Coqui TTS☆84Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).☆15Updated 2 years ago
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Updated 4 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- The first large-scale summarization corpus for the Indonesian language. AACL 2020.☆38Updated 4 years ago
- ☆11Updated 4 years ago
- scipts for working with open.bible data☆26Updated 3 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 5 years ago
- Repository ini berisikan kumpulan data mentah berupa artikel dari berbagai media online di Indonesia. (Raw dataset of Indonesian news art…☆41Updated 6 years ago
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Updated 2 years ago
- Indonesian law dataset containing section annotation of court decision documents☆16Updated 3 years ago
- Word Error Rate Estimation☆15Updated 5 years ago
- A handy dataset of noises for ASR☆22Updated 6 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- ☆19Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆17Updated 2 years ago
- Toolkit for Indobenchmark☆22Updated last year
- asr2k☆52Updated last year
- Scripts to create speech corpora from open.bible☆13Updated 3 years ago