turinaf/Sagalee

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/turinaf/Sagalee)

turinaf / Sagalee

Automatic Speech Recognition Dataset for Oromo Language

☆30

Alternatives and similar repositories for Sagalee

Users that are interested in Sagalee are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
dadelani / menyo-20k_MT
View on GitHub
☆11Jul 12, 2021Updated 5 years ago
changelinglab / prism
View on GitHub
A toolkit and benchmark for evaluating phonetic capabilities of speech models.
☆18Apr 10, 2026Updated 3 months ago
unza-speech-lab / zambezi-voice
View on GitHub
Repository for multilingual speech data resources for native languages of Zambia.
☆22Oct 9, 2024Updated last year
hlt-mt / Speech-MASSIVE
View on GitHub
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆25Oct 8, 2025Updated 9 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
neulab / AfricanVoices
View on GitHub
Hosts text-to-speech corpus and speech synthesizers for African languages.
☆19May 31, 2023Updated 3 years ago
Picovoice / text-to-speech-benchmark
View on GitHub
Text-to-Speech Benchmark
☆26Apr 2, 2026Updated 3 months ago
ajesujoba / UNIQORN
View on GitHub
☆13Jul 30, 2024Updated last year
yfyeung / DS-WED
View on GitHub
[ICASSP 2026] Official code for "Measuring Prosody Diversity in Zero-Shot TTS: A New Metric, Benchmark, and Exploration"
☆17Apr 16, 2026Updated 3 months ago
ahaliassos / usr
View on GitHub
Official implementation of USR (NeurIPS 2024)
☆40Dec 21, 2024Updated last year
lourson1091 / audiobertscore
View on GitHub
☆15Nov 10, 2025Updated 8 months ago
pier-maker92 / ADT_STR
View on GitHub
Automatic Drum Transcription with CLAP-based unsupervised sample curation for synthetic training data generation.
☆18May 5, 2026Updated 2 months ago
taf2 / pocket-tts.c
View on GitHub
Pocket TTS but pure C implementation inspired by Flux2.c
☆22Feb 14, 2026Updated 5 months ago
ozspeech / OZSpeech
View on GitHub
[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching
☆45Feb 9, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
uds-lsv / afro-maft
View on GitHub
☆17Jan 12, 2023Updated 3 years ago
changelinglab / PhoneticXeus
View on GitHub
A universal phone recognizer that can transcribe speech in 70+ languages into IPA
☆29Jun 9, 2026Updated last month
gfdb / wav2aug
View on GitHub
A general purpose task-agnostic speech augmentation policy
☆16Mar 13, 2026Updated 4 months ago
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
X-LANCE / LSCodec-Inference
View on GitHub
Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"
☆36Oct 23, 2025Updated 9 months ago
BetaMasaheft / Manuscripts
View on GitHub
Manuscripts descriptions
☆17Updated this week
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
llm-lab-org / CLASP
View on GitHub
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval
☆13Jun 27, 2025Updated last year
liu12366262626 / AlignVSR
View on GitHub
Visual Speech Recongnition
☆21Dec 24, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
csikasote / BembaSpeech
View on GitHub
This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…
☆41Jul 31, 2025Updated 11 months ago
qiuqiao / DDSP-HiFiGAN
View on GitHub
基于PC-DDSP和nsf-HiFiGAN的声码器
☆19Jul 17, 2023Updated 3 years ago
TartuNLP / tts_preprocess_et
View on GitHub
Estonian text-to-speech text normalization pipeline
☆14Dec 17, 2025Updated 7 months ago
bethelmelesse / UnifiedCrawl
View on GitHub
☆17Nov 26, 2024Updated last year
PyThaiNLP / thai-g2p-wiktionary-corpus
View on GitHub
Thai Grapheme to Phoneme (G2P) Wiktionary Corpus
☆13Jul 25, 2022Updated 4 years ago
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
AAUThematic4LT / Parallel-Corpora-for-Ethiopian-Languages
View on GitHub
☆16Dec 11, 2019Updated 6 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
hitz-zentroa / whisper-lm
View on GitHub
Add n-gram and large language model (LLM) support to Whisper models.
☆43May 6, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
kerodkibatu / ML-DesktopOrganizer
View on GitHub
☆13Oct 23, 2024Updated last year
coryshain / dnnseg
View on GitHub
☆11Mar 20, 2021Updated 5 years ago
hi-paris / CosyVoice2-EU
View on GitHub
Europeanized CosyVoice2 for French & German
☆17Mar 30, 2026Updated 3 months ago
mipuc / hts-engine-world
View on GitHub
☆17Nov 17, 2020Updated 5 years ago
slp-rl / WhiStress
View on GitHub
The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)
☆39Jul 24, 2025Updated last year
Bekacru / better-fetch-docs
View on GitHub
☆11Jul 19, 2024Updated 2 years ago
tongshuangwu / llm-crowdsourcing-pipeline
View on GitHub
☆11Jul 6, 2023Updated 3 years ago