shahules786 / mayavoz
Pytorch based speech enhancement toolkit.
☆336Updated last year
Alternatives and similar repositories for mayavoz:
Users that are interested in mayavoz are comparing it to the libraries listed below
- Desktop application for neural speech synthesis written in C++☆214Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆247Updated last year
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆346Updated last year
- General Speech Restoration☆275Updated last year
- Performant and accurate speech recognition built on Pytorch☆252Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆201Updated last week
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆361Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.☆224Updated 2 years ago
- Conformer-based Metric GAN for speech enhancement☆346Updated 10 months ago
- ☆351Updated 11 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022☆284Updated last year
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆331Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 10 months ago
- A vocal pitch correction web application (like Autotune)☆310Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated 10 months ago
- Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech☆338Updated 2 years ago
- General Speech Restoration☆1,102Updated 3 weeks ago
- This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆219Updated 7 months ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆240Updated 7 months ago
- Pytorch implementation of the CREPE pitch tracker☆429Updated 8 months ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆212Updated last year
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆576Updated last year
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆291Updated 3 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆195Updated 2 years ago
- On-device speech-to-text engine powered by deep learning☆448Updated 3 weeks ago
- DLAS - A configuration-driven trainer for generative models☆138Updated 2 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆339Updated last year
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆313Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.☆306Updated 3 months ago