resemble-ai / resemble-enhance
AI powered speech denoising and enhancement
☆1,641Updated 2 months ago
Alternatives and similar repositories for resemble-enhance:
Users that are interested in resemble-enhance are comparing it to the libraries listed below
- General Speech Restoration☆1,085Updated this week
- Official implementation of "Separate Anything You Describe"☆1,686Updated 2 months ago
- Versatile audio super resolution (any -> 48kHz) with AudioSR.☆1,330Updated last week
- ☆1,111Updated last week
- Controllable and fast Text-to-Speech for over 7000 languages!☆1,545Updated 3 months ago
- TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5,…☆2,016Updated this week
- A nearly-live implementation of OpenAI's Whisper.☆2,448Updated 2 weeks ago
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆882Updated 6 months ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆939Updated 3 months ago
- A python package to build AI-powered real-time audio applications☆1,186Updated last week
- Multilingual Automatic Speech Recognition with word-level timestamps and confidence☆2,244Updated last week
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,741Updated last month
- [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching☆871Updated this week
- The code for the bark-voicecloning model. Training and inference.☆684Updated last year
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion☆649Updated last month
- Noise supression using deep filtering☆2,825Updated 4 months ago
- An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…☆2,251Updated last week
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆638Updated 4 months ago
- An Open Source text-to-speech system built by inverting Whisper.☆4,120Updated 2 months ago
- Generative models for conditional audio generation☆2,896Updated last month
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆584Updated 2 months ago
- Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (pr…☆641Updated last month
- Interface for OuteTTS models.☆923Updated this week
- VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design☆536Updated last year
- ☆719Updated 3 months ago
- WavJourney: Compositional Audio Creation with LLMs☆531Updated last year
- Text-to-Audio/Music Generation☆2,370Updated 4 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆5,465Updated 6 months ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆575Updated last year
- Converts text to speech in realtime☆2,533Updated this week