MahtaFetrat / LLM-Powered-G2PLinks
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.
☆11Updated last month
Alternatives and similar repositories for LLM-Powered-G2P
Users that are interested in LLM-Powered-G2P are comparing it to the libraries listed below
Sorting:
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆10Updated 5 months ago
- ☆11Updated 2 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 9 months ago
- MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation☆13Updated 3 months ago
- Onset-and-Offset-Aware Sound Event Detection☆17Updated 5 months ago
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- Persian Grapheme To Phoneme with Transformer in Pytorch☆11Updated last year
- Sing any popular song with your voice☆11Updated 3 years ago
- ☆11Updated last year
- ☆13Updated 8 months ago
- ☆14Updated last year
- The Vokan Architecture (Tsukasa speech based)☆10Updated 5 months ago
- ☆11Updated 8 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- ☆11Updated 4 months ago
- Speech Resynthesis and Language Modeling☆20Updated last month
- Forced alignment decoder for Whisper.☆14Updated last year
- Voice activity detection and speaker gender segmentation audiovisual corpus☆15Updated 5 months ago
- ☆11Updated last year
- Paper, Code and Statistics for Speech Generatation.☆10Updated 2 years ago
- ☆12Updated 5 months ago
- ☆13Updated 10 months ago
- ☆15Updated last week
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆15Updated 2 years ago
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆11Updated last month
- DysfluentWFST☆13Updated last month
- Code for ICML25 Paper "Overcoming Non-monotonicity in Transducer-based Streaming Generation"☆11Updated last month
- Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]☆8Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆14Updated 7 months ago