indonesian-nlp / wav2vec2-indonesianLinks
☆18Updated 4 years ago
Alternatives and similar repositories for wav2vec2-indonesian
Users that are interested in wav2vec2-indonesian are comparing it to the libraries listed below
Sorting:
- Multilingual Speech Recognition for Indonesian Languages☆66Updated 3 years ago
- Indonesian Grapheme-to-Phoneme (IPA notation)☆38Updated last year
- Automatic speech recognition (ASR) for Indonesian language built by using HTK and Julius. Web interface is built using Node.js.☆21Updated 8 years ago
- g2p ID: Indonesian Grapheme-to-Phoneme Converter☆27Updated 10 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆17Updated 2 years ago
- A curated list of research papers and resources on Indonesian languages☆39Updated last year
- Automatic Speech Recognition for Indonesian☆18Updated 4 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Word Error Rate Estimation☆15Updated 5 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆88Updated 3 years ago
- Welcome to our repository! This repository hosts the data on "IndoCollex: A Testbed for Morphological Transformation of Indonesian Word …☆23Updated 4 years ago
- Scripts to create speech corpora from open.bible☆13Updated 3 years ago
- scipts for working with open.bible data☆25Updated 3 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆29Updated 2 weeks ago
- ☆48Updated 2 years ago
- NLP Datasets for Indonesian☆122Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Updated 2 years ago
- ☆39Updated 3 years ago
- Workflow for forced alignment between languages☆21Updated last year
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated 2 years ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆24Updated last year