amirldn / rtx-voice-scriptLinks
A python script that takes an input MP3/FLAC and outputs an acapella/background noise stripped WAV using the power of NVIDIA's RTX Voice
☆91Updated 6 months ago
Alternatives and similar repositories for rtx-voice-script
Users that are interested in rtx-voice-script are comparing it to the libraries listed below
Sorting:
- A Gradio setup for Tortoise TTS.☆45Updated 2 years ago
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆33Updated 2 years ago
- Desktop application for neural speech synthesis written in C++☆215Updated 2 years ago
- A web app that lets you play around with TalkNet models☆123Updated 2 years ago
- real time japanese speech recognition translator using wav2vec2☆39Updated 3 years ago
- ☆63Updated 4 years ago
- ☆29Updated 4 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- AI Video Processing/Upscaling With VapourSynth in Google Colab☆113Updated 5 years ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆68Updated 3 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Package for aligning audio files through audio fingerprinting☆126Updated 5 months ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- Uses machine learning to denoise audio containing speech☆39Updated last year
- NNSVSのモデルをUTAUで使えるようにするツール (UTAU plugin software powered by NNSVS)☆95Updated last week
- Synchronize Whisper's timestamps over an existing accurate transcription☆155Updated last year
- Google collab for testing SoftVC VITS Singing Voice Conversion for AI capable of changing the singer within music files.☆12Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Updated last year
- Text To Speech (TTS) GUI wrapper for NVIDIA Tacotron 2+Waveglow. For custom Twitch TTS.☆37Updated 5 years ago
- This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.☆45Updated 2 years ago
- [WIP] VoiceSmith makes training text to speech models easy.☆225Updated 2 years ago
- Extract hardcoded subtitles from videos using machine learning☆199Updated 2 weeks ago
- An easy way to use anime4k in python☆118Updated 3 months ago
- ☆15Updated 2 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆337Updated 9 months ago
- ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.☆69Updated 2 years ago
- RVC Inference with multiple model and huggingface support☆106Updated last year
- Ultimate Vocal Remover CLI type for Google Colab☆58Updated 2 weeks ago
- Chainer implementation of waifu2x☆165Updated 2 years ago
- GUI for a Vocal Remover that uses Deep Neural Networks.☆17Updated last year