MahtaFetrat / LLM-Powered-G2PLinks
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.
☆11Updated 2 months ago
Alternatives and similar repositories for LLM-Powered-G2P
Users that are interested in LLM-Powered-G2P are comparing it to the libraries listed below
Sorting:
- Persian Grapheme-to-Phoneme (G2P) converter☆20Updated 4 years ago
- ☆11Updated 9 months ago
- Sing any popular song with your voice☆11Updated 3 years ago
- Forced alignment decoder for Whisper.☆14Updated last year
- The Vokan Architecture (Tsukasa speech based)☆10Updated 6 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆12Updated last month
- Persian Grapheme To Phoneme with Transformer in Pytorch☆11Updated last year
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆11Updated 6 months ago
- Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…☆14Updated 4 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Updated 4 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 10 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 4 months ago
- ☆13Updated last week
- ☆18Updated last month
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆16Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- ☆16Updated last week
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆20Updated 2 years ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Updated 4 years ago
- ☆38Updated 3 weeks ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 2 months ago
- The official repository for NonVerbalSpeech-38K.☆15Updated this week
- Transfer learning approach to pronunciation scoring☆10Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18Updated last year
- text to speech☆10Updated last year
- Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…☆29Updated last month
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆19Updated 2 months ago
- Persian Grapheme-to-Phoneme (G2P) converter