egorsmkv/ukrainian-tts-datasets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/egorsmkv/ukrainian-tts-datasets)

egorsmkv / ukrainian-tts-datasets

🇺🇦 Open Source Ukrainian Text-to-Speech datasets

☆30

Alternatives and similar repositories for ukrainian-tts-datasets

Users that are interested in ukrainian-tts-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
cpuimage / Tacotron-2
View on GitHub
Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)
☆11Jul 12, 2019Updated 6 years ago
Mddct / transformer-vocos
View on GitHub
☆36Sep 6, 2025Updated 10 months ago
pengzhendong / wavesurfer
View on GitHub
For audio visualization and playback in Jupyter notebooks.
☆18Nov 25, 2025Updated 7 months ago
pengzhendong / audiolab
View on GitHub
A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)
☆39Mar 31, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
xicri / genshin-langdata
View on GitHub
English-Chinese-Japanese translation dataset of the terms in Genshin Impact
☆42Updated this week
bigcash / spleeter-pytorch-mnn
View on GitHub
convert spleeter pretrained model to pytorch and onnx, then convert to mnn
☆21Dec 17, 2020Updated 5 years ago
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
shigabeev / russian_tts_normalization
View on GitHub
Fast Russian Text normalization for TTS using only RegEx.
☆34Jun 27, 2026Updated 2 weeks ago
patriotyk / narizaka
View on GitHub
Tool to make high quality text to speech (tts) corpus from audio + text books.
☆28Jul 31, 2025Updated 11 months ago
magicse / ncnn-hifi-GAN
View on GitHub
ncnn HiFi-GAN
☆30Sep 29, 2024Updated last year
P2Oileen / CitationHelper
View on GitHub
Google Scholar自搜小脚本，每次开启命令行即显示当前citation。Small Script displaying current citation count each time the shell is opened.
☆21Mar 3, 2025Updated last year
PaulAnnekov / uasiren
View on GitHub
Implements siren.pp.ua API - public wrapper for api.ukrainealarm.com API that returns info about Ukraine air-raid alarms.
☆22May 25, 2022Updated 4 years ago
multimodal-art-projection / Open-Suno
View on GitHub
trying to reproduce suno v3
☆34Jan 29, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
lifeiteng / Aligner-SUPERB
View on GitHub
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
☆38May 7, 2025Updated last year
Ereboas / MagiCodec
View on GitHub
A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.
☆118Jun 4, 2025Updated last year
lang-uk / ukrainian-word-stress
View on GitHub
Adds word stress to Ukrainian texts
☆62Sep 29, 2024Updated last year
pengzhendong / torchfa
View on GitHub
Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.
☆61Sep 5, 2025Updated 10 months ago
zhai-lw / SQCodec
View on GitHub
A lightweight audio codec based on a single quantizer
☆71Aug 15, 2025Updated 10 months ago
magicse / GFPGANv1.3-to-ncnn
View on GitHub
The GFPGAN network consists of two networks. Actually GFPGAN and StyleGAN2
☆41Sep 22, 2022Updated 3 years ago
nguyenvulebinh / spoken-norm
View on GitHub
Transformation spoken text to written text
☆31May 14, 2024Updated 2 years ago
bamboolife / SoundTouch
View on GitHub
Android使用SoundTouch实现音频的变调变速
☆32Dec 21, 2019Updated 6 years ago
ex3ndr / supervoice-gpt
View on GitHub
GPT-style network for phonemization with durations of text
☆69Mar 21, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mcf330 / efts2code
View on GitHub
source code of EfficientTTS 2
☆21Feb 18, 2024Updated 2 years ago
Soul-AILab / SoulX-Singer-Eval
View on GitHub
A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis
☆32Feb 11, 2026Updated 5 months ago
facebookresearch / WavFlow
View on GitHub
MultiModal Audio Generation in Raw Waveform Space.
☆154May 26, 2026Updated last month
weishengying / tiny-flash-attention
View on GitHub
使用 cutlass 实现 flash-attention 精简版，具有教学意义
☆59Aug 12, 2024Updated last year
yataoz / face_reenact_GDPW
View on GitHub
Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation
☆12Jan 6, 2023Updated 3 years ago
lorniu / bilibili.el
View on GitHub
Emacs 中看 B 站
☆10Jul 27, 2025Updated 11 months ago
7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year
haoheliu / SemantiCodec
View on GitHub
☆45Jun 11, 2024Updated 2 years ago
sony / bigvsan_eval
View on GitHub
Evaluation tool used in the BigVSAN paper
☆14Mar 22, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
luojie1024 / MossQA-mnbvc
View on GitHub
本项目主要对开源的MOSS SFT数据进行整理，转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面，共353w样本，MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数，共630w样本，
☆13Dec 3, 2023Updated 2 years ago
magicse / ncnn-colorization-siggraph17
View on GitHub
☆45Jun 23, 2023Updated 3 years ago
zjumml / DiffSinger
View on GitHub
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
☆10Mar 8, 2022Updated 4 years ago
mubtasimahasan / DM-Codec
View on GitHub
Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”
☆57Jun 1, 2025Updated last year
fishaudio / vocoder
View on GitHub
☆131Updated this week
kyegomez / MobileVLM
View on GitHub
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Mar 11, 2024Updated 2 years ago
dense-analysis / ranges
View on GitHub
Range-based algorithms in Go
☆14Sep 10, 2023Updated 2 years ago