younengma / eden-ttsView external linksLinks
☆12Dec 29, 2023Updated 2 years ago
Alternatives and similar repositories for eden-tts
Users that are interested in eden-tts are comparing it to the libraries listed below
Sorting:
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- ICASSP 2024 paper - A Fully Differentiable Model for Unsupervised Singing Voice Separation☆14Mar 7, 2025Updated 11 months ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆15Oct 27, 2023Updated 2 years ago
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- ☆17Jan 26, 2021Updated 5 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Voice conversion with just linear regression.☆33Sep 25, 2025Updated 4 months ago
- ☆32Apr 22, 2024Updated last year
- Phoneme segmentation using pre-trained speech models☆55Nov 4, 2022Updated 3 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27May 30, 2025Updated 8 months ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆29Sep 6, 2023Updated 2 years ago
- ☆64Jan 15, 2024Updated 2 years ago
- SDX23 startkit for the Demucs baselines.☆30Mar 3, 2023Updated 2 years ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆77Dec 3, 2025Updated 2 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Mar 14, 2025Updated 11 months ago
- ☆30Nov 5, 2023Updated 2 years ago
- working on parallel wavenet☆25Apr 19, 2018Updated 7 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- ☆69May 19, 2023Updated 2 years ago
- ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation☆38Nov 20, 2024Updated last year
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 2 years ago
- VALL-E 2 reproduction☆134Jul 14, 2024Updated last year
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 8 months ago
- ☆82Jan 22, 2025Updated last year
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆37Oct 27, 2025Updated 3 months ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Aug 21, 2023Updated 2 years ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆235Apr 20, 2024Updated last year
- PyTorch version of Spotify's Basic Pitch☆44Apr 19, 2024Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆159Jun 13, 2024Updated last year
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Feb 9, 2026Updated last week
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer …☆39May 16, 2021Updated 4 years ago
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆98Nov 14, 2024Updated last year
- Frequency tracking in time-frequency representations☆13Jan 19, 2021Updated 5 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year