Charsiu: A neural phonetic aligner.
☆340Sep 19, 2022Updated 3 years ago
Alternatives and similar repositories for charsiu
Users that are interested in charsiu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multilingual G2P in 100 languages☆382May 26, 2023Updated 2 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆362Dec 24, 2021Updated 4 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆122Feb 23, 2025Updated last year
- Segment an audio file and obtain utterance alignments. (Python package)☆346May 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Command line utility for forced alignment using Kaldi☆1,796Mar 31, 2026Updated 2 weeks ago
- Official implementation of the source-filter HiFiGAN vocoder☆271Jul 29, 2023Updated 2 years ago
- Simple text to phones converter for multiple languages☆1,531Sep 26, 2024Updated last year
- g2p: English Grapheme To Phoneme Conversion☆917Jan 5, 2023Updated 3 years ago
- ☆198May 3, 2024Updated last year
- A differentiable version of SPTK☆197Mar 26, 2026Updated 3 weeks ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆347Jan 18, 2026Updated 3 months ago
- Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner☆44Jun 22, 2021Updated 4 years ago
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆139May 8, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆415Aug 29, 2023Updated 2 years ago
- Phonetisaurus G2P☆516Jun 1, 2024Updated last year
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆62Jun 8, 2021Updated 4 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆329Sep 24, 2022Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Nov 13, 2021Updated 4 years ago
- ☆260May 15, 2023Updated 2 years ago
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago
- pytorch implementation of DNN-HSMM for TTS☆71Mar 14, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆268Jan 13, 2025Updated last year
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- A Python wrapper for the high-quality vocoder "World"☆784Jan 21, 2025Updated last year
- Official Implementation of StyleTTS☆462Jan 13, 2025Updated last year
- Official implementation of SawSing (ISMIR'22)☆275Aug 28, 2022Updated 3 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Massively multilingual pronunciation mining☆365Updated this week
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 3 years ago
- ☆171Jul 25, 2022Updated 3 years ago
- An opensource music processing toolkit☆320Jun 25, 2023Updated 2 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆707Jul 12, 2022Updated 3 years ago
- Pytorch implementation of the CREPE pitch tracker☆513May 16, 2025Updated 11 months ago
- Modified Python3 P2FA for Mandarin☆10Sep 21, 2020Updated 5 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 10 months ago