Open-source text-to-speech for European languages with voice cloning
☆231Feb 6, 2026Updated last month
Alternatives and similar repositories for kugelaudio-open
Users that are interested in kugelaudio-open are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Echo-TTS inference codebase☆145Dec 5, 2025Updated 3 months ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆18Nov 19, 2025Updated 4 months ago
- ☆179Nov 8, 2025Updated 4 months ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated 2 months ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Jan 29, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆36Sep 9, 2025Updated 6 months ago
- feature-rich web interface designed to interact with a local ComfyUI☆76Dec 10, 2025Updated 3 months ago
- MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection☆26May 29, 2025Updated 9 months ago
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆213Jan 13, 2026Updated 2 months ago
- LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Ital…☆94Jan 14, 2026Updated 2 months ago
- whitebox is a Prometheus exporter that provides availability monitoring of external VPN services powered by VMESS, VLESS, TROJAN, WireGua…☆67Feb 22, 2026Updated last month
- python package for calculating famous measures in computational linguistics☆15Nov 5, 2024Updated last year
- A QGIS plugin for tree monitoring using AI.☆20Sep 19, 2025Updated 6 months ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆76Jan 25, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 5 months ago
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 9 months ago
- 🏠 SSH, but each user gets their own microVM☆104Oct 5, 2025Updated 5 months ago
- ☆40Jul 15, 2025Updated 8 months ago
- Train text generation model with JavaScript.☆15Jul 14, 2024Updated last year
- ☆12Jan 5, 2024Updated 2 years ago
- Unified Repository for all Browning Trail Camera Reversing☆14May 1, 2025Updated 10 months ago
- ☆16Jan 26, 2023Updated 3 years ago
- MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…☆166Mar 6, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Reflection Removal through Efficient Adaptation of Diffusion Transformers☆122Dec 5, 2025Updated 3 months ago
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressions…☆22Dec 12, 2024Updated last year
- NeMo: a toolkit for conversational AI☆12Dec 23, 2022Updated 3 years ago
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆51Sep 20, 2025Updated 6 months ago
- ☆16Aug 1, 2024Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- Add n-gram and large language model (LLM) support to Whisper models.☆41May 6, 2025Updated 10 months ago
- UUGear Web Interface is an extremely lightweight web server that allows you to access your Raspberry Pi and UUGear devices in web browser…☆12Apr 26, 2024Updated last year
- ggCorpIdent: A package for ggplot2 graphics in corporate design with custom fonts, colors and logo.☆13Apr 4, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.☆65Dec 9, 2025Updated 3 months ago
- MOSS-Speech is a true speech-to-speech large language model without text guidance.☆127Feb 13, 2026Updated last month
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆28Apr 23, 2024Updated last year
- ☆100Jan 19, 2026Updated 2 months ago
- A real-time and multilingual speech translation model☆219Feb 13, 2026Updated last month
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated last year
- Combine MPEG DASH MPD manifest files☆11Apr 7, 2023Updated 2 years ago