MahtaFetrat / LLM-Powered-G2PLinks
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.
☆13Updated 6 months ago
Alternatives and similar repositories for LLM-Powered-G2P
Users that are interested in LLM-Powered-G2P are comparing it to the libraries listed below
Sorting:
- Sing any popular song with your voice☆11Updated 3 years ago
- The Vokan Architecture (Tsukasa speech based)☆10Updated 9 months ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆16Updated 2 years ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆34Updated 3 months ago
- ☆14Updated 3 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated last year
- ☆13Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆29Updated 3 weeks ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Updated 4 years ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Updated 3 weeks ago
- ☆11Updated 2 years ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆16Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- ☆13Updated 8 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Updated 2 months ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆21Updated 2 weeks ago
- Onset-and-Offset-Aware Sound Event Detection☆20Updated 9 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 8 months ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆28Updated 2 months ago
- ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models☆31Updated last week
- Pybind11 bindings for Kaldi☆14Updated 2 months ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Updated 2 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆13Updated 8 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆22Updated 2 years ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12Updated last year
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Updated 2 years ago
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆38Updated 6 months ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 3 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Updated 2 years ago
- ☆18Updated last year