coqui-ai / open-bible-scripts
scipts for working with open.bible data
β24Updated 3 years ago
Alternatives and similar repositories for open-bible-scripts:
Users that are interested in open-bible-scripts are comparing it to the libraries listed below
- Scripts to create speech corpora from open.bibleβ13Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.β13Updated last year
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- Linguistic processing for Common Voiceβ53Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.β20Updated 11 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- β17Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 3 years ago
- β11Updated 3 years ago
- phone inventory libraryβ16Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β17Updated 2 years ago
- African accented clinical and general domain TTSβ10Updated 8 months ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech represβ¦β21Updated 11 months ago
- Rescoring methods for end-to-end Automatic Speech Recognitionβ27Updated 4 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/β¦β35Updated 2 months ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Courtβ22Updated 2 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasetsβ12Updated 3 years ago
- asr2kβ49Updated 9 months ago
- β34Updated this week
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Phonetically-Oriented Word Error Rateβ33Updated 5 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITIONβ40Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β44Updated 3 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ14Updated 2 years ago
- Collection of scripts from mHuBERT-147.β24Updated 3 months ago
- Forced alignment decoder for Whisper.β14Updated 11 months ago
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- Word Error Rate Estimationβ11Updated 4 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis latticesβ16Updated 11 months ago
- β33Updated 8 months ago