Niger-Volta-LTI / yoruba-voiceLinks
Repo & Project for the Imminent Research Grant code & tasks
☆12Updated last year
Alternatives and similar repositories for yoruba-voice
Users that are interested in yoruba-voice are comparing it to the libraries listed below
Sorting:
- ☆17Updated 4 years ago
- phone inventory library☆16Updated 2 years ago
- Transfer learning approach to pronunciation scoring☆10Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- ☆18Updated 3 years ago
- ☆11Updated last week
- IPA tokeniser☆18Updated 2 weeks ago
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆15Updated 6 months ago
- ☆11Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆25Updated 4 months ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆21Updated last year
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆15Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆27Updated last year
- ☆22Updated 11 months ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)☆33Updated last year
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 6 months ago
- Text-to-Speech Latency Benchmark☆17Updated last month
- Repository for multilingual speech data resources for native languages of Zambia.☆18Updated 10 months ago
- Forced alignment decoder for Whisper.☆14Updated last year
- Simple Kaldi recipe for forced alignment☆10Updated 2 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆20Updated 2 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆12Updated last month
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- ☆16Updated last year
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Updated 4 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year