harmlessman / PAFTSLinks
PAFTS : Library That Preprocessing Audio For TTS.
☆21Updated 8 months ago
Alternatives and similar repositories for PAFTS
Users that are interested in PAFTS are comparing it to the libraries listed below
Sorting:
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- ☆25Updated 3 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆27Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated last month
- High quality text-to-speech based on StyleTTS 2.☆52Updated this week
- g2p ID: Indonesian Grapheme-to-Phoneme Converter☆24Updated 7 months ago
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆27Updated 2 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆22Updated 8 months ago
- ☆57Updated last year
- Bilingual-TTS (Japanese and Korean)☆30Updated 2 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- Convert English text from written expressions into spoken forms☆25Updated 3 years ago
- Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion☆20Updated 6 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 3 months ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆45Updated 2 years ago
- ☆17Updated 2 years ago
- ☆29Updated last year
- Colab notebooks for Next-gen Kaldi☆28Updated 3 months ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆116Updated 2 years ago
- A handy dataset of noises for ASR☆21Updated 6 years ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Updated 4 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆18Updated 2 months ago
- ☆80Updated last year
- ☆40Updated 10 months ago
- Audio Diarization Annotation tool☆29Updated 5 years ago