amirldn / rtx-voice-scriptLinks
A python script that takes an input MP3/FLAC and outputs an acapella/background noise stripped WAV using the power of NVIDIA's RTX Voice
☆93Updated 2 weeks ago
Alternatives and similar repositories for rtx-voice-script
Users that are interested in rtx-voice-script are comparing it to the libraries listed below
Sorting:
- A Gradio setup for Tortoise TTS.☆45Updated 2 years ago
- real time japanese speech recognition translator using wav2vec2☆39Updated 3 years ago
- AI Video Processing/Upscaling With VapourSynth in Google Colab☆113Updated 5 years ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆72Updated 8 months ago
- RIFE interpolation script for google colab, and GUI for Windows or Linux☆51Updated 5 months ago
- ☆63Updated 5 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 3 years ago
- Colaboratory Notebook for Ultimate Vocal Remover☆102Updated 3 weeks ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Updated 2 years ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆160Updated last year
- openvino version of openai/whisper☆182Updated 2 years ago
- Desktop application for neural speech synthesis written in C++☆212Updated this week
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆34Updated 2 years ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆87Updated 2 years ago
- Auto transcribe tool based on whisper☆226Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- A web app that lets you play around with TalkNet models☆124Updated 2 years ago
- ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.☆69Updated 3 years ago
- Text To Speech (TTS) GUI wrapper for NVIDIA Tacotron 2+Waveglow. For custom Twitch TTS.☆37Updated 5 years ago
- Modified version of the PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆22Updated last month
- Uses machine learning to denoise audio containing speech☆49Updated last year
- ☆33Updated 9 months ago
- ☆76Updated 3 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 3 years ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆186Updated 3 years ago
- Clone a voice in a few seconds to generate arbitrary speech in real-time in multiple languages☆54Updated 2 years ago
- The BEST music separation model with help of A.I. ... to my ears ! 👂👂☆147Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆348Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- PyTorch-based Super-Resolution and Restoration Image Processing Module for VapourSynth☆199Updated last year