josephrocca / lyra-v2-soundstream-webLinks
Lyra V2 (SoundStream) running in the browser
☆19Updated last year
Alternatives and similar repositories for lyra-v2-soundstream-web
Users that are interested in lyra-v2-soundstream-web are comparing it to the libraries listed below
Sorting:
- ☆23Updated 2 years ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆13Updated 11 months ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Updated last year
- A simple voice conversion tool☆17Updated 3 years ago
- GPT for FACodec☆13Updated last year
- ☆24Updated 2 months ago
- Acoustic Neighbor Embeddings☆24Updated 7 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆100Updated 9 months ago
- Simple PyTorch Denoisers for Waveform Audio☆35Updated 2 months ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 3 years ago
- ☆40Updated 5 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- Demo for 2022 ICASSP☆64Updated 3 years ago
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- Zero-Shot Emotion Style Transfer☆48Updated 2 months ago
- ☆29Updated last year
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆27Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 6 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆21Updated 3 weeks ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆25Updated last year
- ☆41Updated 2 years ago
- Melody Lyric Transformer Implementation and Model☆10Updated 2 years ago
- The demo page of UniAudio☆34Updated last year
- speaker-disentangled speech linguistic content quantizer☆21Updated 3 months ago
- ☆13Updated 10 months ago
- My vocoder experiments☆30Updated this week