louis-she / gradio-logLinks
A Gradio component designed to continuously show any logs.
☆49Updated 8 months ago
Alternatives and similar repositories for gradio-log
Users that are interested in gradio-log are comparing it to the libraries listed below
Sorting:
- Running the F5-TTS by ONNX Runtime☆171Updated this week
- ☆62Updated last year
- ONNX implementation of Whisper. PyTorch free.☆101Updated 8 months ago
- Speech AI training and inference tools☆36Updated 2 years ago
- A lightweight end-to-end text-to-speech model☆117Updated 5 months ago
- stable-diffusion.cpp bindings for python☆58Updated last month
- 8-bit CUDA functions for PyTorch☆43Updated 2 years ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated last year
- ☆39Updated last year
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated last year
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆53Updated 7 months ago
- audiolm-pytorch training code☆15Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- ImageSlider custom component for gradio.☆42Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆38Updated 4 months ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆31Updated 5 months ago
- Nue-ASR inference code by rinna Co., Ltd.☆35Updated last year
- RVC Onnx Infer- Upgraded and simplified-ish☆21Updated last year
- FRP Fork☆175Updated 4 months ago
- ☆83Updated last year
- F5-TTS 推理加速,速度提升约4倍!☆104Updated 7 months ago
- ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)☆219Updated last year
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆69Updated 2 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆99Updated last week
- whisper.cpp bindings for python☆101Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- Open TTS models, built for streaming on the edge☆43Updated 5 months ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆120Updated this week