Niger-Volta-LTI / yoruba-voiceLinks
Repo & Project for the Imminent Research Grant code & tasks
☆12Updated last year
Alternatives and similar repositories for yoruba-voice
Users that are interested in yoruba-voice are comparing it to the libraries listed below
Sorting:
- phone inventory library☆16Updated 2 years ago
- ☆17Updated 4 years ago
- Transfer learning approach to pronunciation scoring☆10Updated last year
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆15Updated last year
- IPA tokeniser☆17Updated last month
- Voice activity detection and speaker gender segmentation audiovisual corpus☆15Updated 7 months ago
- This is the experimental description of MnTTS2.☆11Updated last year
- Simple Kaldi recipe for forced alignment☆10Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 2 years ago
- ☆11Updated 3 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆14Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- Forced alignment decoder for Whisper.☆14Updated last year
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆35Updated last month
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆28Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆18Updated 9 months ago
- DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently☆11Updated last year
- Repository for multilingual speech data resources for native languages of Zambia.☆18Updated 10 months ago
- Text-to-Speech Latency Benchmark☆18Updated 2 months ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆12Updated 5 years ago
- scipts for working with open.bible data☆25Updated 3 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆12Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆25Updated 5 months ago
- Mason-Alberta Phonetic Segmenter☆12Updated 7 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 6 months ago
- ☆16Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆15Updated 2 years ago
- ☆25Updated 3 years ago
- Pybind11 bindings for Kaldi☆14Updated last month