harmlessman / PAFTSLinks
PAFTS : Library That Preprocessing Audio For TTS.
☆25Updated last year
Alternatives and similar repositories for PAFTS
Users that are interested in PAFTS are comparing it to the libraries listed below
Sorting:
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆113Updated 3 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆316Updated 4 years ago
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆76Updated last year
- A curated list of awesome voice activity detection☆71Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆132Updated last month
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Updated last year
- High quality text-to-speech based on StyleTTS 2.☆71Updated last month
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Updated 2 years ago
- ☆36Updated last month
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆184Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆177Updated 7 months ago
- 로봇의 감정 및 개성을 표현할 수 있는 대화형 음성합성 오픈소스 플랫폼☆108Updated last year
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆258Updated last year
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆121Updated 3 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Updated 2 years ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Updated 3 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆45Updated 2 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆179Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated 2 years ago
- A sequence-to-sequence voice conversion toolkit.☆108Updated last year
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆104Updated 10 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆113Updated 2 months ago
- ☆80Updated 6 months ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆266Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆30Updated 5 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Updated 4 years ago
- ☆15Updated last year