🇺🇦 Open Source Ukrainian Text-to-Speech datasets
☆23Feb 24, 2025Updated last year
Alternatives and similar repositories for ukrainian-tts-datasets
Users that are interested in ukrainian-tts-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Forced alignment decoder for Whisper.☆15Mar 13, 2024Updated 2 years ago
- Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)☆11Jul 12, 2019Updated 6 years ago
- ☆36Sep 6, 2025Updated 7 months ago
- For audio visualization and playback in Jupyter notebooks.☆17Nov 25, 2025Updated 4 months ago
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆38Mar 31, 2026Updated 2 weeks ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- English-Chinese-Japanese translation dataset of the terms in Genshin Impact☆40Apr 8, 2026Updated last week
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆21Dec 17, 2020Updated 5 years ago
- ☆23Oct 17, 2024Updated last year
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- Tool to make high quality text to speech (tts) corpus from audio + text books.☆27Jul 31, 2025Updated 8 months ago
- ncnn HiFi-GAN☆29Sep 29, 2024Updated last year
- Google Scholar自搜小脚本,每次开启命令行即显示当前citation。Small Script displaying current citation count each time the shell is opened.☆21Mar 3, 2025Updated last year
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- Exploring techniques for code refactoring with formal verification☆11Oct 27, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆36May 7, 2025Updated 11 months ago
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆114Jun 4, 2025Updated 10 months ago
- A template for using Elm Land to build desktop apps with Tauri!☆12Jun 20, 2023Updated 2 years ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆61Sep 5, 2025Updated 7 months ago
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 8 months ago
- The GFPGAN network consists of two networks. Actually GFPGAN and StyleGAN2☆41Sep 22, 2022Updated 3 years ago
- Transformation spoken text to written text☆31May 14, 2024Updated last year
- Android使用SoundTouch实现音频的变调变速☆31Dec 21, 2019Updated 6 years ago
- That Uno game we built in Elm live on Twitch!☆15Sep 9, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated 2 years ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆24Feb 11, 2026Updated 2 months ago
- PiHub a WebSockets hub to control your Raspberry Pi microcomputer☆13Dec 8, 2022Updated 3 years ago
- ☆13Sep 12, 2024Updated last year
- Emacs 中看 B 站☆11Jul 27, 2025Updated 8 months ago
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- ☆45Jun 11, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆12Dec 3, 2023Updated 2 years ago
- ☆45Jun 23, 2023Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆57Jun 1, 2025Updated 10 months ago
- ☆132Apr 6, 2026Updated 2 weeks ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Content's best friend.☆21Apr 21, 2025Updated 11 months ago