🇺🇦 Open Source Ukrainian Text-to-Speech datasets
☆30Feb 24, 2025Updated last year
Alternatives and similar repositories for ukrainian-tts-datasets
Users that are interested in ukrainian-tts-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Forced alignment decoder for Whisper.☆16Mar 13, 2024Updated 2 years ago
- Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)☆11Jul 12, 2019Updated 6 years ago
- ☆36Sep 6, 2025Updated 9 months ago
- For audio visualization and playback in Jupyter notebooks.☆18Nov 25, 2025Updated 6 months ago
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆39Mar 31, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- English-Chinese-Japanese translation dataset of the terms in Genshin Impact☆41Jun 2, 2026Updated 2 weeks ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆21Dec 17, 2020Updated 5 years ago
- ☆23Oct 17, 2024Updated last year
- Fast Russian Text normalization for TTS using only RegEx.☆30Updated this week
- Tool to make high quality text to speech (tts) corpus from audio + text books.☆27Jul 31, 2025Updated 10 months ago
- ncnn HiFi-GAN☆30Sep 29, 2024Updated last year
- Google Scholar自搜小脚本,每 次开启命令行即显示当前citation。Small Script displaying current citation count each time the shell is opened.☆21Mar 3, 2025Updated last year
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- Exploring techniques for code refactoring with formal verification☆11Oct 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆38May 7, 2025Updated last year
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆118Jun 4, 2025Updated last year
- Adds word stress to Ukrainian texts☆62Sep 29, 2024Updated last year
- A template for using Elm Land to build desktop apps with Tauri!☆12Jun 20, 2023Updated 3 years ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆61Sep 5, 2025Updated 9 months ago
- A lightweight audio codec based on a single quantizer☆71Aug 15, 2025Updated 10 months ago
- The GFPGAN network consists of two networks. Actually GFPGAN and StyleGAN2☆41Sep 22, 2022Updated 3 years ago
- Transformation spoken text to written text☆31May 14, 2024Updated 2 years ago
- Android使用SoundTouch实现音频的变调变速☆32Dec 21, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- That Uno game we built in Elm live on Twitch!☆16Sep 9, 2022Updated 3 years ago
- GPT-style network for phonemization with durations of text☆69Mar 21, 2024Updated 2 years ago
- source code of EfficientTTS 2☆21Feb 18, 2024Updated 2 years ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆29Feb 11, 2026Updated 4 months ago
- MultiModal Audio Generation in Raw Waveform Space.☆153May 26, 2026Updated 3 weeks ago
- PiHub a WebSockets hub to control your Raspberry Pi microcomputer☆13Dec 8, 2022Updated 3 years ago
- Emacs 中看 B 站☆10Jul 27, 2025Updated 10 months ago
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago
- ☆13Sep 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆45Jun 11, 2024Updated 2 years ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆13Dec 3, 2023Updated 2 years ago
- ☆45Jun 23, 2023Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆57Jun 1, 2025Updated last year
- Content's best friend.☆22Apr 21, 2025Updated last year