MahtaFetrat / LLM-Powered-G2PLinks
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.
☆13Updated 4 months ago
Alternatives and similar repositories for LLM-Powered-G2P
Users that are interested in LLM-Powered-G2P are comparing it to the libraries listed below
Sorting:
- The Vokan Architecture (Tsukasa speech based)☆10Updated 8 months ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Updated 4 years ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆18Updated last month
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆16Updated 2 years ago
- Transfer learning approach to pronunciation scoring☆11Updated last year
- Forced alignment decoder for Whisper.☆14Updated last year
- ☆14Updated 2 months ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆21Updated 2 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated 11 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Updated 6 months ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Updated 2 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆13Updated 6 months ago
- Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…☆15Updated 7 months ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆15Updated last year
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated 9 months ago
- Persian Grapheme-to-Phoneme (G2P) converter☆41Updated last year
- ☆13Updated 11 months ago
- This is the experimental description of MnTTS2.☆11Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Sing any popular song with your voice☆11Updated 3 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆21Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 5 months ago
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Updated 4 years ago
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆34Updated 5 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆20Updated 3 weeks ago
- Persian Grapheme-to-Phoneme (G2P) converter☆20Updated 4 years ago
- StyleTTS 2 Optimized Training Fork☆33Updated 8 months ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆27Updated last month