🇺🇦 Open Source Ukrainian Text-to-Speech datasets
☆29Feb 24, 2025Updated last year
Alternatives and similar repositories for ukrainian-tts-datasets
Users that are interested in ukrainian-tts-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Forced alignment decoder for Whisper.☆16Mar 13, 2024Updated 2 years ago
- Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)☆11Jul 12, 2019Updated 6 years ago
- ☆36Sep 6, 2025Updated 8 months ago
- For audio visualization and playback in Jupyter notebooks.☆17Nov 25, 2025Updated 6 months ago
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆38Mar 31, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- English-Chinese-Japanese translation dataset of the terms in Genshin Impact☆41Updated this week
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆21Dec 17, 2020Updated 5 years ago
- ☆23Oct 17, 2024Updated last year
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- Tool to make high quality text to speech (tts) corpus from audio + text books.☆27Jul 31, 2025Updated 10 months ago
- ncnn HiFi-GAN☆30Sep 29, 2024Updated last year
- Google Scholar自搜小脚本,每次开启命令行即显示当前citation。Small Script displaying current citation count each time the shell is opened.☆21Mar 3, 2025Updated last year
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆36May 7, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆115Jun 4, 2025Updated 11 months ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆61Sep 5, 2025Updated 8 months ago
- A lightweight audio codec based on a single quantizer☆70Aug 15, 2025Updated 9 months ago
- The GFPGAN network consists of two networks. Actually GFPGAN and StyleGAN2☆41Sep 22, 2022Updated 3 years ago
- Transformation spoken text to written text☆31May 14, 2024Updated 2 years ago
- Android使用SoundTouch实现音频的变调变速☆31Dec 21, 2019Updated 6 years ago
- GPT-style network for phonemization with durations of text☆69Mar 21, 2024Updated 2 years ago
- source code of EfficientTTS 2☆21Feb 18, 2024Updated 2 years ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆27Feb 11, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆59Aug 12, 2024Updated last year
- Emacs 中看 B 站☆10Jul 27, 2025Updated 10 months ago
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago
- ☆13Sep 12, 2024Updated last year
- ☆45Jun 11, 2024Updated last year
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆13Dec 3, 2023Updated 2 years ago
- ☆45Jun 23, 2023Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆57Jun 1, 2025Updated 11 months ago
- ☆132May 4, 2026Updated 3 weeks ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Range-based algorithms in Go☆14Sep 10, 2023Updated 2 years ago
- ☆25Mar 6, 2024Updated 2 years ago
- A breakdown of NCNN☆43Dec 28, 2020Updated 5 years ago
- ☆151Apr 25, 2025Updated last year