Clone a voice in a few seconds to generate arbitrary speech in real-time in multiple languages
☆55Mar 21, 2023Updated 3 years ago
Alternatives and similar repositories for TTS-With-Voice-Cloning-Multilang
Users that are interested in TTS-With-Voice-Cloning-Multilang are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆14Mar 17, 2022Updated 4 years ago
- ☆10Mar 22, 2024Updated 2 years ago
- ☆13Jun 10, 2021Updated 4 years ago
- Can Neural Networks reconstruct missing audio data? What about GANs?☆18Nov 6, 2019Updated 6 years ago
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Scripts for training Kaldi for German speech recognition (ASR).☆27Feb 11, 2021Updated 5 years ago
- ☆13Sep 12, 2024Updated last year
- ☆13Dec 7, 2022Updated 3 years ago
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆32Mar 3, 2026Updated 3 weeks ago
- speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, …☆11Dec 7, 2025Updated 3 months ago
- Generation of musical phrases that receive maximum score according to configurable evaluational criteria.☆13Oct 17, 2023Updated 2 years ago
- ☆10Mar 20, 2021Updated 5 years ago
- ☆21Jul 15, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Infrastructure useful to create natural language processing systems based on transformer networks☆12Sep 26, 2019Updated 6 years ago
- Using image caption models to extract prompts in ComfyUI☆11May 21, 2025Updated 10 months ago
- GPT Prompt Trainer for gpt-3.5-turbo language model☆12Apr 5, 2023Updated 2 years ago
- A video highlights creator☆12Jun 1, 2024Updated last year
- Least-squares Reverse Time Migration using 1D scalar wave equation. Very simple and for demonstration purposes only.☆10Sep 4, 2017Updated 8 years ago
- An advanced AI-powered tool that automatically translates and dubs YouTube videos into different languages while dynamically adjusting vi…☆16Nov 9, 2024Updated last year
- Survey on speech generation work.☆21Nov 26, 2023Updated 2 years ago
- Extract dominant or complementary color palettes from images. Convert colors to English names suitable for txt2img prompts.☆16Jan 5, 2025Updated last year
- Extract individual frames from a video as png images (android)☆13Dec 30, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Algorithmic composition of modern classical music in the twelve-tone technique.☆14May 10, 2025Updated 10 months ago
- An easy-to-use GUI addon for whisper-standalone-win. Designed for those who prefer a simple interface over typing commands and file paths…☆13Dec 26, 2023Updated 2 years ago
- Skribify is a powerful transcription and summarization tool that leverages the power of OpenAI's GPT-4 and WhisperAI to generate concise …☆12Apr 29, 2025Updated 11 months ago
- ☆18Mar 13, 2024Updated 2 years ago
- ☆30Apr 12, 2022Updated 3 years ago
- T5-based (russian) text normalization☆26Jan 25, 2024Updated 2 years ago
- A powerful extension for ComfyUI that enables adding notes to any node in your workflow.☆13Apr 20, 2025Updated 11 months ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- A real time offline transcriber with gui, based on OpenAI whisper☆16Dec 25, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Amlogic update tool☆11Jul 21, 2024Updated last year
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- Spiking neural networks (SNNs) for speech classification☆12Mar 14, 2022Updated 4 years ago
- An advanced midi router for MacOS☆12Dec 26, 2024Updated last year
- ☆11Oct 24, 2021Updated 4 years ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆16Feb 21, 2025Updated last year
- ☆13Oct 27, 2025Updated 5 months ago