louis-she / gradio-logLinks
A Gradio component designed to continuously show any logs.
☆51Updated 10 months ago
Alternatives and similar repositories for gradio-log
Users that are interested in gradio-log are comparing it to the libraries listed below
Sorting:
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆20Updated 10 months ago
- audiolm-pytorch training code☆15Updated 2 years ago
- 🌻 VITS ONNX TTS server designed for fast inference 🔥☆128Updated 9 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆88Updated 3 weeks ago
- Running the F5-TTS by ONNX Runtime☆179Updated last month
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 8 months ago
- Nue-ASR inference code by rinna Co., Ltd.☆35Updated last month
- A lightweight end-to-end text-to-speech model☆123Updated 8 months ago
- ☆57Updated last year
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆106Updated 3 weeks ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆128Updated 2 months ago
- StyleTTS 2 Optimized Training Fork☆34Updated 8 months ago
- ☆262Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆52Updated 10 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆45Updated 5 months ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆39Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last week
- ☆50Updated 2 weeks ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- whisper.cpp bindings for python☆106Updated 2 years ago
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Updated last year
- Collection of Open Source Speech Data☆161Updated 3 weeks ago
- ☆62Updated last year
- a Frontier Japanese Speech Generation net☆56Updated 5 months ago
- An unofficial PyTorch implementation of VALL-E☆88Updated 2 months ago