indonesian-nlp / wav2vec2-indonesian
☆18Updated 3 years ago
Alternatives and similar repositories for wav2vec2-indonesian:
Users that are interested in wav2vec2-indonesian are comparing it to the libraries listed below
- Multilingual Speech Recognition for Indonesian Languages☆61Updated 2 years ago
- Indonesian Grapheme-to-Phoneme (IPA notation)☆31Updated last year
- A curated list of research papers and resources on Indonesian languages☆39Updated 11 months ago
- Welcome to our repository! This repository hosts the data on "IndoCollex: A Testbed for Morphological Transformation of Indonesian Word …☆20Updated 3 years ago
- Repository ini berisikan kumpulan data mentah berupa artikel dari berbagai media online di Indonesia. (Raw dataset of Indonesian news art…☆41Updated 5 years ago
- Indonesian TTS (text-to-speech) using Coqui TTS☆67Updated 2 years ago
- NLP Datasets for Indonesian☆112Updated 2 years ago
- Automatic Speech Recognition for Indonesian☆16Updated 3 years ago
- CLIP (Contrastive Language–Image Pre-training) trained on Indonesian data☆19Updated 3 years ago
- A dataset for Indonesian Named Entity Recognizer☆29Updated 4 years ago
- Indonesian Language Models and its Usage☆157Updated last year
- g2p ID: Indonesian Grapheme-to-Phoneme Converter☆17Updated 2 months ago
- A benchmark dataset for Indonesian text summarization.☆77Updated 5 years ago
- Automatic speech recognition (ASR) for Indonesian language built by using HTK and Julius. Web interface is built using Node.js.☆20Updated 8 years ago
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Updated 3 years ago
- NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented an…☆25Updated 4 months ago
- High-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper…☆97Updated last year
- ☆97Updated 6 years ago
- Toolkit for Indobenchmark☆15Updated last year
- The first large-scale summarization corpus for the Indonesian language. AACL 2020.☆35Updated 3 years ago
- IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented…☆96Updated 4 years ago
- English conversation corpus for conversational TTS.☆20Updated last year
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆14Updated 2 weeks ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- A pipeline framework for data science projects☆11Updated 2 years ago
- ☆12Updated 3 weeks ago
- Benchmarking Multidomain English-Indonesian Machine Translation☆16Updated 4 years ago
- Dependency Parser and NER model for Bahasa Indonesia Spacy 2.1☆20Updated 4 years ago
- A list of Indonesian NLP resources.☆279Updated 3 years ago
- IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)☆61Updated 3 years ago