harmlessman / PAFTSLinks
PAFTS : Library That Preprocessing Audio For TTS.
☆25Updated last year
Alternatives and similar repositories for PAFTS
Users that are interested in PAFTS are comparing it to the libraries listed below
Sorting:
- A curated list of awesome voice activity detection☆71Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆184Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆132Updated last month
- Predicts the level of noise and reverberation on your audiofiles☆177Updated 7 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 3 years ago
- Official repository of SepReformer for speech separation☆243Updated last year
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆316Updated 4 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆179Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆266Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆258Updated last year
- ☆98Updated last week
- ☆94Updated last year
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆76Updated last year
- General Speech Restoration☆283Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆124Updated 3 years ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆175Updated 9 months ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆160Updated 4 years ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆235Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated 2 years ago
- Python Wrapper for RnNoise v0.2☆74Updated 3 weeks ago
- Unofficial implementation of NVIDIA P-Flow TTS paper☆231Updated last year
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆105Updated last year
- Target Speaker Extraction Toolkit☆244Updated 4 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆112Updated 2 months ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Updated 3 years ago
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆29Updated 2 years ago