louis-she / gradio-logLinks
A Gradio component designed to continuously show any logs.
☆54Updated last year
Alternatives and similar repositories for gradio-log
Users that are interested in gradio-log are comparing it to the libraries listed below
Sorting:
- ONNX implementation of Whisper. PyTorch free.☆102Updated last year
- Running the F5-TTS by ONNX Runtime☆186Updated last month
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- audiolm-pytorch training code☆15Updated 2 years ago
- A lightweight end-to-end text-to-speech model☆125Updated 10 months ago
- ONNX and TensorRT implementation of Whisper☆66Updated 2 years ago
- RTVC: Real-Time Voice Conversion GUI☆59Updated 2 years ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆39Updated 9 months ago
- a Frontier Japanese Speech Generation net☆59Updated 7 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 10 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆131Updated 4 months ago
- RVC Onnx Infer- Upgraded and simplified-ish☆25Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆91Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆104Updated last year
- F5-TTS 推理加速,速度提升约4倍!☆121Updated 11 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 4 years ago
- ☆261Updated last year
- ☆58Updated last year
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Updated last year
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆25Updated last year
- ☆62Updated last year
- 🌻 VITS ONNX TTS server designed for fast inference 🔥☆129Updated 10 months ago
- StyleTTS 2 Optimized Training Fork☆33Updated 10 months ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆128Updated 2 years ago
- An unofficial PyTorch implementation of VALL-E☆88Updated 4 months ago
- ☆51Updated this week
- Misc. tools/scripts that I made to use for tortoise☆21Updated last year