JJWRoeloffs / transcribe_align_textgridLinks
A small wrapper package around whisper-timestamped. Create force-aligned transcription TextGrids from raw audio!
☆16Updated 2 months ago
Alternatives and similar repositories for transcribe_align_textgrid
Users that are interested in transcribe_align_textgrid are comparing it to the libraries listed below
Sorting:
- ☆40Updated 3 years ago
- ☆80Updated last year
- SelfRemaster: SSL Speech Restoration☆88Updated last year
- Official Code for ParrotTTS☆51Updated 7 months ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 11 months ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆43Updated 2 years ago
- A Chinese version of A Neural Parametric Singing Synthesizer☆12Updated 3 years ago
- ☆56Updated 2 years ago
- ☆13Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated last year
- multilingual speech aligner☆74Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆54Updated last year
- ☆29Updated last year
- This code is to run the WARP-Q speech quality metric.☆35Updated 7 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- ☆26Updated 4 months ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆28Updated 3 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- MFA acoustic model training based on Opencpop☆15Updated 2 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆51Updated 9 months ago
- ☆24Updated last month
- ☆100Updated 9 months ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆33Updated 4 years ago
- ☆57Updated 2 years ago
- How to use our public wav2vec2 age and gender model☆41Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- ☆29Updated last year