Charsiu: A neural phonetic aligner.
☆332Sep 19, 2022Updated 3 years ago
Alternatives and similar repositories for charsiu
Users that are interested in charsiu are comparing it to the libraries listed below
Sorting:
- Multilingual G2P in 100 languages☆378May 26, 2023Updated 2 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆361Dec 24, 2021Updated 4 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- Official implementation of the source-filter HiFiGAN vocoder☆268Jul 29, 2023Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345May 15, 2024Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆122Feb 23, 2025Updated last year
- Command line utility for forced alignment using Kaldi☆1,757Feb 24, 2026Updated last week
- A differentiable version of SPTK☆193Feb 26, 2026Updated last week
- ☆197May 3, 2024Updated last year
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆139May 8, 2022Updated 3 years ago
- g2p: English Grapheme To Phoneme Conversion☆911Jan 5, 2023Updated 3 years ago
- Simple text to phones converter for multiple languages☆1,515Sep 26, 2024Updated last year
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆415Aug 29, 2023Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆148Apr 5, 2024Updated last year
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Nov 13, 2021Updated 4 years ago
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆151Feb 11, 2023Updated 3 years ago
- ☆259May 15, 2023Updated 2 years ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆344Jan 18, 2026Updated last month
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆328Sep 24, 2022Updated 3 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆268Jan 13, 2025Updated last year
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆223Oct 20, 2023Updated 2 years ago
- Official implementation of SawSing (ISMIR'22)☆272Aug 28, 2022Updated 3 years ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆368Sep 3, 2024Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 2 years ago
- A Python wrapper for the high-quality vocoder "World"☆779Jan 21, 2025Updated last year
- ☆171Jul 25, 2022Updated 3 years ago
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago
- Pytorch implementation of the CREPE pitch tracker☆508May 16, 2025Updated 9 months ago
- Phonetisaurus G2P☆507Jun 1, 2024Updated last year
- A suite of speech signal processing tools☆243Feb 3, 2026Updated last month
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 9 months ago
- Grapheme to phoneme conversion with deep learning.☆421Dec 8, 2023Updated 2 years ago
- Massively multilingual pronunciation mining☆363Updated this week
- Official Implementation of StyleTTS☆462Jan 13, 2025Updated last year
- PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.☆330Feb 9, 2024Updated 2 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆62Jun 8, 2021Updated 4 years ago