rhasspy / espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
☆17Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for espeak-ng
- C++ library for converting text to phonemes for Piper☆89Updated 8 months ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆31Updated 3 months ago
- A fast MP3 decoder for python, using minimp3☆26Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- ☆18Updated 2 years ago
- WaveRNN Vocoder + TTS☆11Updated 3 years ago
- TTS Client for Coqui TTS server☆13Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆24Updated last year
- Interact with GPT-3 through speech☆13Updated last year
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆33Updated last year
- On-device noise suppression powered by deep learning☆63Updated last month
- Open models for Coqui STT☆122Updated last year
- 🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video ga…☆39Updated last year
- An even smaller speech recognizer / force aligner☆32Updated this week
- Port of Suno AI's Bark in C/C++ for fast inference☆54Updated 7 months ago
- Real-time end-to-end singing voice convertion☆18Updated 3 weeks ago
- Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files☆18Updated 2 months ago
- The demo page of AudioGPT☆20Updated 3 months ago
- Automatic background removal from an input video and a single user subject selection☆10Updated 2 months ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Heteronym to Phoneme Parser☆15Updated last year
- Port of Meta's Encodec in C/C++☆203Updated last week
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Ea…☆13Updated 3 years ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆95Updated this week
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated 2 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 2 years ago
- A ggml (C++) re-implementation of tortoise-tts☆160Updated 3 months ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆45Updated 4 months ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆13Updated 11 months ago