dimasikson / audio-cleaning-deep-learning
Deep Learning model that cleans audio of empty audio, miscellaneous sounds, etc. Suitable for podcast editing.
☆37Updated 3 years ago
Alternatives and similar repositories for audio-cleaning-deep-learning:
Users that are interested in audio-cleaning-deep-learning are comparing it to the libraries listed below
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆32Updated last year
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆35Updated last year
- Speaker diarization service☆21Updated this week
- Clone a voice in a few seconds to generate arbitrary speech in real-time in multiple languages☆53Updated last year
- DEPRECATED version of SoundFile☆14Updated 4 years ago
- An AI model to remove noise from the input audio using deep learning model which predicts the type of noise present and filter it out fro…☆31Updated 4 years ago
- Tools to create your own voice dataset for TTS training☆66Updated 4 years ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆153Updated 8 months ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Simple PyTorch Denoisers for Waveform Audio☆34Updated 2 months ago
- Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEE…☆185Updated last year
- Fork of AudioLDM as a TuneFlow plugin☆39Updated last year
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 file…☆39Updated 5 months ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆46Updated 2 years ago
- Python library for audio augmentation☆83Updated last year
- Easily turn large sets of audio urls to an audio dataset.☆20Updated 2 years ago
- Python Audio Separator in Real Time using MDX-NET model☆18Updated last year
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated last year
- On-device noise suppression powered by deep learning☆66Updated last week
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆193Updated 2 years ago
- A Python library that can apply: darth vader, echo, radio, robotic, and ghost effects to audio samples.☆57Updated last year
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆45Updated 7 months ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- List of repositories relevant to VITS.☆36Updated last year
- Create an LJSpeech structured voice dataset on wave input☆26Updated 4 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆57Updated last week