shahules786 / mayavoz
Pytorch based speech enhancement toolkit.
☆329Updated 6 months ago
Related projects: ⓘ
- Performant and accurate speech recognition built on Pytorch☆247Updated 2 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆234Updated last year
- A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)☆464Updated 7 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆132Updated last year
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆323Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.☆272Updated 2 months ago
- Desktop application for neural speech synthesis written in C++☆210Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆421Updated 10 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆132Updated 4 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆312Updated 6 months ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆878Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.☆217Updated last year
- A real-time transcription project using React and socketio☆144Updated last year
- On-device speech-to-text engine powered by deep learning☆427Updated this week
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆329Updated 2 years ago
- Voice models for Mimic 3 text to speech system☆121Updated 2 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆315Updated 7 months ago
- ☆208Updated this week
- Official Implementation of StyleTTS☆387Updated 9 months ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …☆496Updated last year
- Gecko - A Tool for Effective Annotation of Human Conversations☆274Updated last year
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆335Updated last year
- General Speech Restoration☆995Updated 3 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆234Updated last year
- Audio waveform visualisation, converts any audio to a nice video☆210Updated 8 months ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆551Updated last year
- A vocal pitch correction web application (like Autotune)☆296Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆122Updated 4 months ago
- Noise supression using deep filtering☆2,354Updated last month
- Tutorial covering Open Source tools for Source Separation.☆351Updated 3 months ago