yuboona/some-script-to-help-using-Montreal-Forced-Aligner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yuboona/some-script-to-help-using-Montreal-Forced-Aligner)

yuboona / some-script-to-help-using-Montreal-Forced-Aligner

Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textgrid files.

☆14

Alternatives and similar repositories for some-script-to-help-using-Montreal-Forced-Aligner

Users that are interested in some-script-to-help-using-Montreal-Forced-Aligner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
timedomain-tech / ACE_phonemes
View on GitHub
a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine
☆44Jan 17, 2025Updated last year
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
yangdongchao / ALMTokenizer
View on GitHub
The demo page for ALMTokenizer
☆59Apr 14, 2025Updated last year
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zizyzhang / DNN-Based-Singing-Voice-Synthesis
View on GitHub
DNN based singing voice synthesis
☆17Oct 15, 2018Updated 7 years ago
bshall / urhythmic
View on GitHub
Unsupervised Rhythm Modeling for Voice Conversion
☆85Aug 3, 2023Updated 2 years ago
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
channel-io / ch-tts-llasa-rl-grpo
View on GitHub
☆51Apr 20, 2026Updated 3 months ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
LAION-AI / scaled-echo-tts
View on GitHub
Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024…
☆24Mar 29, 2026Updated 3 months ago
errolyan / text_normalization_CH
View on GitHub
TTS前，文本标准化，将数字字母处理转化为汉字
☆12Apr 27, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lmxue / ICASSP2022_TTS_VC_Summary
View on GitHub
ICASSP2022 TTS&VC Summary
☆13Jun 9, 2022Updated 4 years ago
erogol / TTS_tf
View on GitHub
WIP Tensorflow implementation of https://github.com/mozilla/TTS
☆15Apr 11, 2020Updated 6 years ago
hongwen-sun / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆15Dec 19, 2018Updated 7 years ago
msalhab96 / SNR-Estimation-Using-Deep-Learning
View on GitHub
An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning
☆43Mar 23, 2022Updated 4 years ago
thuhcsi / SnakeGAN
View on GitHub
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Apr 25, 2023Updated 3 years ago
jasonppy / syllable-discovery
View on GitHub
Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
☆35Aug 27, 2023Updated 2 years ago
Mddct / transformer-vocos
View on GitHub
☆35Sep 6, 2025Updated 10 months ago
omarperacha / GANkyoku
View on GitHub
A Generative Adversarial Network for Shakuhachi Music
☆14Jul 2, 2019Updated 7 years ago
XinyuZhou2000 / Spoken-Dialogue
View on GitHub
☆18Dec 7, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Tencent / SongBench
View on GitHub
☆51Apr 30, 2026Updated 2 months ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
ljuvela / multiscale-GAN
View on GitHub
Code for ICASSP 2019 paper
☆18Oct 29, 2018Updated 7 years ago
xcmyz / Lifelong-Learning-Tacotron2
View on GitHub
MultiSpeaker Tacotron2 using LifeLong Learning.
☆13Sep 27, 2019Updated 6 years ago
pc2752 / ss_synthesis
View on GitHub
☆17Jul 31, 2019Updated 6 years ago
streichgeorg / autosing
View on GitHub
☆18Jan 20, 2025Updated last year
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
neonbjb / tts-scores
View on GitHub
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
☆175Dec 18, 2023Updated 2 years ago
FantSun / CycleFlow
View on GitHub
his code is a pytorch version for CycleFlow model in "CycleFlow: Purify Information Factors by Cycle Loss"
☆15Jan 14, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
OpenNSP / Hifi-vaegan
View on GitHub
☆47Aug 31, 2024Updated last year
xiquan-li / Resonate
View on GitHub
[INTERSPEECH 2026] Pre-training, SFT, DPO and GRPO for Text-to-Audio Generation
☆48Apr 17, 2026Updated 3 months ago
W-Wu / ERC-SLT22
View on GitHub
Code for "Distribution-based Emotion Recognition in Conversation"
☆18Feb 6, 2023Updated 3 years ago
SparkAudio / VoxBox
View on GitHub
A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.
☆115May 5, 2025Updated last year
xiaomi-research / tts-prism
View on GitHub
☆47Apr 27, 2026Updated 3 months ago
mcf330 / efts2code
View on GitHub
source code of EfficientTTS 2
☆21Feb 18, 2024Updated 2 years ago