harmlessman / PAFTSLinks
PAFTS : Library That Preprocessing Audio For TTS.
☆24Updated last year
Alternatives and similar repositories for PAFTS
Users that are interested in PAFTS are comparing it to the libraries listed below
Sorting:
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆131Updated 2 weeks ago
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- Python Wrapper for RnNoise v0.2☆73Updated 3 weeks ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆157Updated 4 years ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 7 months ago
- Predicts the level of noise and reverberation on your audiofiles☆174Updated 6 months ago
- Official repository of SepReformer for speech separation☆236Updated 11 months ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆113Updated 3 years ago
- Unofficial implementation of NVIDIA P-Flow TTS paper☆231Updated last year
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Updated 2 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆315Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆264Updated 11 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆178Updated last year
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆76Updated last year
- General Speech Restoration☆283Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆67Updated 3 years ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆183Updated last year
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 9 months ago
- An LLM base TTS engine☆114Updated 2 weeks ago
- Ultimate Vocal Remover Inference CLI☆107Updated 11 months ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.☆52Updated 4 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆124Updated 3 years ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆180Updated last year
- Target Speaker Extraction Toolkit☆238Updated 3 months ago
- ☆94Updated 2 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆151Updated 7 months ago
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆94Updated 2 years ago