Charsiu: A neural phonetic aligner.
☆345Sep 19, 2022Updated 3 years ago
Alternatives and similar repositories for charsiu
Users that are interested in charsiu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multilingual G2P in 100 languages☆386May 26, 2023Updated 3 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆365Dec 24, 2021Updated 4 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆123Feb 23, 2025Updated last year
- Segment an audio file and obtain utterance alignments. (Python package)☆347May 15, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Command line utility for forced alignment using Kaldi☆1,835Updated this week
- Official implementation of the source-filter HiFiGAN vocoder☆272Jul 29, 2023Updated 2 years ago
- Simple text to phones converter for multiple languages☆1,555Sep 26, 2024Updated last year
- A differentiable version of SPTK☆201Jun 2, 2026Updated 2 weeks ago
- g2p: English Grapheme To Phoneme Conversion☆924Jan 5, 2023Updated 3 years ago
- ☆200May 3, 2024Updated 2 years ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆348Jan 18, 2026Updated 5 months ago
- Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner☆43Jun 22, 2021Updated 4 years ago
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆139May 8, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆415Aug 29, 2023Updated 2 years ago
- Phonetisaurus G2P☆517Jun 1, 2024Updated 2 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆62Jun 8, 2021Updated 5 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆328Sep 24, 2022Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆190Nov 18, 2021Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Nov 13, 2021Updated 4 years ago
- ☆260May 15, 2023Updated 3 years ago
- multilingual speech aligner☆78Nov 19, 2023Updated 2 years ago
- pytorch implementation of DNN-HSMM for TTS☆71Mar 14, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆269Jan 13, 2025Updated last year
- ICASSP 2023 Accepted☆191May 6, 2024Updated 2 years ago
- A Python wrapper for the high-quality vocoder "World"☆788Jan 21, 2025Updated last year
- Official Implementation of StyleTTS☆464Jan 13, 2025Updated last year
- Yin pitch estimator in PyTorch☆119Nov 7, 2022Updated 3 years ago
- Official implementation of SawSing (ISMIR'22)☆275Aug 28, 2022Updated 3 years ago
- An opensource music processing toolkit☆320Jun 25, 2023Updated 2 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Massively multilingual pronunciation mining☆368May 23, 2026Updated 3 weeks ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆46Apr 18, 2023Updated 3 years ago
- ☆171Jul 25, 2022Updated 3 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆714Jul 12, 2022Updated 3 years ago
- Pytorch implementation of the CREPE pitch tracker☆516May 16, 2025Updated last year
- A repository for benchmarking neural vocoders by their quality and speed.☆212May 30, 2025Updated last year
- Modified Python3 P2FA for Mandarin☆10Sep 21, 2020Updated 5 years ago