ishine/PnG-BERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ishine/PnG-BERT)

ishine / PnG-BERT

PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS

☆24

Alternatives and similar repositories for PnG-BERT

Users that are interested in PnG-BERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Infinity-INF / fast-phasr
View on GitHub
Phonemes and durations labeling based on whisper small
☆11Jul 7, 2024Updated 2 years ago
yl4579 / PL-BERT
View on GitHub
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆270Jan 13, 2025Updated last year
google-research-datasets / WikipediaHomographData
View on GitHub
Labeled data for homograph disambiguation
☆62Jun 1, 2023Updated 3 years ago
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago
WX-Wei / HarmoF0
View on GitHub
☆108Aug 23, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
XierHacker / Model_Fusion_Based_Prosody_Prediction
View on GitHub
Model Fusion Based Prosody Prediction
☆17Mar 18, 2018Updated 8 years ago
AlexandaJerry / SingingVoice-MFA-Training
View on GitHub
MFA acoustic model training based on Opencpop
☆15Sep 23, 2022Updated 3 years ago
choiHkk / Transformer-TTS-V2
View on GitHub
☆25Mar 6, 2024Updated 2 years ago
sh-lee-prml / BigVGAN
View on GitHub
Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training
☆136Feb 18, 2023Updated 3 years ago
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
BridgetteSong / Tacotron2
View on GitHub
☆13Sep 21, 2022Updated 3 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
makerjackie / tts-frontend-dataset
View on GitHub
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
☆104Feb 5, 2024Updated 2 years ago
lifeiteng / SoundStorm
View on GitHub
☆71Jul 13, 2023Updated 3 years ago
xcmyz / CLONE
View on GitHub
☆20Jul 13, 2022Updated 4 years ago
hs-oh-prml / DurFlexEVC
View on GitHub
☆82Jan 22, 2025Updated last year
walker-hyf / GPT-Talker
View on GitHub
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆78Nov 1, 2024Updated last year
entn-at / DurIAN-1
View on GitHub
Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".
☆15Jul 6, 2020Updated 6 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
JoungheeKim / Non-Attentive-Tacotron
View on GitHub
This is Pytorch Implementation of Google's Non-attentive Tacotron.
☆57Dec 21, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
innnky / vispeech
View on GitHub
基于vits fastspeech2 visinger的tts模型
☆24Mar 9, 2023Updated 3 years ago
yanggeng1995 / Multi-band-WaveRNN
View on GitHub
☆45Dec 16, 2019Updated 6 years ago
youngsheen / GPST
View on GitHub
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
☆70Nov 1, 2024Updated last year
reppy4620 / vocoders
View on GitHub
My vocoder experiments
☆31Jul 26, 2025Updated last year
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
zjwang21 / mix-phoneme-bert
View on GitHub
An unofficial PyTorch implementation of Mix-Phoneme-Bert
☆40Jul 10, 2023Updated 3 years ago
janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
ex3ndr / supervoice-flow
View on GitHub
SpeechFlow neural network implementation
☆23Aug 8, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
lifeiteng / VoiceBox
View on GitHub
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆29Aug 4, 2023Updated 2 years ago
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
lucidrains / spear-tts-pytorch
View on GitHub
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
☆277Oct 30, 2023Updated 2 years ago
wonjune-kang / expressive-speech-retrieval
View on GitHub
Expressive Speech Retrieval using Natural Language Descriptions of Speaking Style
☆15Aug 18, 2025Updated 11 months ago
monglechap / fluenttts
View on GitHub
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS
☆20Nov 15, 2022Updated 3 years ago
NVIDIA / radtts
View on GitHub
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …
☆291Apr 6, 2023Updated 3 years ago