xinliu9451 / awesome-denoiser
This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very friendly for beginners.
☆34Updated 3 months ago
Alternatives and similar repositories for awesome-denoiser:
Users that are interested in awesome-denoiser are comparing it to the libraries listed below
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆37Updated last week
- ☆39Updated last month
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆61Updated 2 weeks ago
- Open TTS models, built for streaming on the edge☆39Updated 2 weeks ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆68Updated 5 months ago
- ☆25Updated 5 months ago
- StyleTTS 2 Optimized Training Fork☆26Updated last month
- a Frontier Japanese Speech Generation net☆28Updated 3 weeks ago
- F5-TTS 推理加速,速度提升约4倍!☆64Updated 2 months ago
- VoiceBox neural network implementation☆105Updated 8 months ago
- Official Code for ParrotTTS☆48Updated 5 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆71Updated 7 months ago
- ☆56Updated 9 months ago
- Zero-Shot Emotion Style Transfer☆43Updated 11 months ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆238Updated 2 weeks ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆64Updated 5 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆85Updated last year
- Running the F5-TTS by ONNX Runtime☆135Updated this week
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 7 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆84Updated 3 months ago
- A collection of all our phonemeizers for dataset construction and inference☆22Updated last month
- CTC decoder with hotwords for ASR.☆17Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 5 months ago
- ☆103Updated last month
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆24Updated 9 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 4 months ago
- The demo page of UniAudio☆33Updated last year
- The official Implementation of PeriodWave and PeriodWave-Turbo☆181Updated last month
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆25Updated this week
- The official implementation of EmoSphere++☆80Updated 2 weeks ago