kyamauchi1023 / PL-BERT-jaView external linksLinks
A repository of Japanese Phoneme-Level BERT
☆22Dec 16, 2023Updated 2 years ago
Alternatives and similar repositories for PL-BERT-ja
Users that are interested in PL-BERT-ja are comparing it to the libraries listed below
Sorting:
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- 日本音響学会誌用BibTeXスタイルファイル☆11Jan 24, 2022Updated 4 years ago
- xvector model on jtubespeech☆47Nov 5, 2023Updated 2 years ago
- ☆36Sep 20, 2022Updated 3 years ago
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- ☆60Jan 8, 2025Updated last year
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- ☆26Aug 8, 2024Updated last year
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆29Mar 28, 2024Updated last year
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Oct 19, 2023Updated 2 years ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆54Sep 25, 2023Updated 2 years ago
- Taiwanese Speech Synthesis with Tacotron2☆25Oct 2, 2022Updated 3 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆22Jun 24, 2024Updated last year
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- pyopenjtalk-plus: A Python wrapper for OpenJTalk with additional improvements☆55Nov 18, 2025Updated 2 months ago
- ☆25Jan 24, 2023Updated 3 years ago
- ☆26Jun 5, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- ☆10May 16, 2024Updated last year
- ☆11Nov 7, 2024Updated last year
- ☆10Oct 16, 2025Updated 4 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 6 months ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- ☆14Aug 16, 2023Updated 2 years ago
- ☆15Nov 10, 2025Updated 3 months ago
- ☆14Aug 1, 2025Updated 6 months ago
- ☆15Nov 11, 2024Updated last year
- DysfluentWFST☆17Nov 13, 2025Updated 3 months ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- ☆13Mar 11, 2025Updated 11 months ago
- ☆11Oct 14, 2023Updated 2 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆31May 14, 2024Updated last year
- ☆26Sep 22, 2022Updated 3 years ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Oct 16, 2023Updated 2 years ago
- Chinese polyphone disambiguation for Text-to-Speech application☆42Jun 11, 2024Updated last year