slp-rl / HebTTS
The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"
☆89Updated 8 months ago
Alternatives and similar repositories for HebTTS:
Users that are interested in HebTTS are comparing it to the libraries listed below
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730☆128Updated last year
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆68Updated 5 months ago
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆45Updated 3 weeks ago
- ☆88Updated this week
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆101Updated 2 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆85Updated 11 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆186Updated this week
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆84Updated 3 months ago
- Official repository of Wavehax vocoder☆46Updated 4 months ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆150Updated last year
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆94Updated 5 months ago
- VoiceBox neural network implementation☆105Updated 7 months ago
- Official Implementation of StyleTTS-VC☆177Updated 2 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆35Updated last week
- The official implementation of EmoSphere++☆80Updated last week
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Updated 8 months ago
- ☆39Updated last month
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- Official implementation for FlowSep☆34Updated 2 months ago
- ☆16Updated 2 months ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆128Updated 2 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆206Updated 11 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆60Updated 2 weeks ago
- ☆69Updated 2 months ago
- Pytorch implementation of SoundCTM☆85Updated last month
- [Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation