🇺🇦 Open Source Ukrainian Text-to-Speech datasets
☆26Feb 24, 2025Updated last year
Alternatives and similar repositories for ukrainian-tts-datasets
Users that are interested in ukrainian-tts-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Forced alignment decoder for Whisper.☆15Mar 13, 2024Updated 2 years ago
- Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)☆11Jul 12, 2019Updated 6 years ago
- ☆36Sep 6, 2025Updated 8 months ago
- For audio visualization and playback in Jupyter notebooks.☆17Nov 25, 2025Updated 5 months ago
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆38Mar 31, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- English-Chinese-Japanese translation dataset of the terms in Genshin Impact☆41Apr 30, 2026Updated last week
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆21Dec 17, 2020Updated 5 years ago
- ☆23Oct 17, 2024Updated last year
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- Tool to make high quality text to speech (tts) corpus from audio + text books.☆27Jul 31, 2025Updated 9 months ago
- ncnn HiFi-GAN☆30Sep 29, 2024Updated last year
- Google Scholar自搜小脚本,每次开启命令行即显示当前citation。Small Script displaying current citation count each time the shell is opened.☆21Mar 3, 2025Updated last year
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆36May 7, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆114Jun 4, 2025Updated 11 months ago
- Adds word stress to Ukrainian texts☆61Sep 29, 2024Updated last year
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆61Sep 5, 2025Updated 8 months ago
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 8 months ago
- The GFPGAN network consists of two networks. Actually GFPGAN and StyleGAN2☆41Sep 22, 2022Updated 3 years ago
- Transformation spoken text to written text☆31May 14, 2024Updated last year
- Android使用SoundTouch实现音频的变调变速☆31Dec 21, 2019Updated 6 years ago
- GPT-style network for phonemization with durations of text☆69Mar 21, 2024Updated 2 years ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆26Feb 11, 2026Updated 3 months ago
- 使用 cutlass 实现 flash-attention 精简版, 具有教学意义☆59Aug 12, 2024Updated last year
- Emacs 中看 B 站☆11Jul 27, 2025Updated 9 months ago
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago
- ☆13Sep 12, 2024Updated last year
- ☆45Jun 11, 2024Updated last year
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆13Dec 3, 2023Updated 2 years ago
- ☆45Jun 23, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆57Jun 1, 2025Updated 11 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- ☆132Apr 6, 2026Updated last month
- A breakdown of NCNN☆43Dec 28, 2020Updated 5 years ago
- ☆25Mar 6, 2024Updated 2 years ago
- ☆150Apr 25, 2025Updated last year