MahtaFetrat / LLM-Powered-G2P
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.
☆8Updated 2 months ago
Alternatives and similar repositories for LLM-Powered-G2P:
Users that are interested in LLM-Powered-G2P are comparing it to the libraries listed below
- ManaTTS is the largest open Persian speech dataset with 100+ hours of transcribed audio. Includes data collection pipeline and tools. Sui…☆25Updated 2 months ago
- Persian Grapheme To Phoneme with Transformer in Pytorch☆11Updated last year
- Persian Grapheme-to-Phoneme (G2P) converter☆20Updated 4 years ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆10Updated 2 months ago
- Persian Grapheme-to-Phoneme (G2P) converter☆40Updated 9 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 6 months ago
- ☆11Updated 6 months ago
- Tihu dictionary for Persian language☆12Updated 5 years ago
- A collection of inspiring lists, repos, datasets, models, tools and more for Persian language speech to text(stt) and text to speech(tts)…☆63Updated 4 months ago
- Sharif Emotional Speech Database☆34Updated 4 years ago
- A Grapheme to Phoneme model using LSTM implemented in pytorch☆12Updated 2 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆20Updated last year
- ☆11Updated 2 months ago
- A simple command line tool to calculate WER for ASR.☆14Updated 6 months ago
- ☆44Updated last year
- Forced alignment decoder for Whisper.☆14Updated last year
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Updated 4 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆13Updated 2 weeks ago
- Mason-Alberta Phonetic Segmenter☆9Updated 3 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated last month
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆16Updated 2 years ago
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 3 years ago
- ☆11Updated 2 years ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆14Updated last year
- Whisper Speech Quality Assessment (WhiSQA)☆9Updated 4 months ago
- Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]☆8Updated 9 months ago
- text to speech☆10Updated last year
- ☆13Updated 8 months ago