rishikksh20 / SoundStorm-pytorch
Google's SoundStorm: Efficient Parallel Audio Generation
☆130Updated last year
Alternatives and similar repositories for SoundStorm-pytorch:
Users that are interested in SoundStorm-pytorch are comparing it to the libraries listed below
- Codec for paper: LLaSA: Scaling Train-time and Test-time Compute for LLaMA-based Speech Synthesis☆126Updated 2 weeks ago
- VoiceLDM: Text-to-Speech with Environmental Context☆168Updated 5 months ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆118Updated 3 weeks ago
- Implementation of SoundStorm built upon SpeechTokenizer.☆108Updated last year
- Audiogen Codec☆130Updated 6 months ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆126Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆145Updated last year
- Train the next generation of TTS systems.☆162Updated 4 months ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆123Updated 4 months ago
- ☆70Updated last year
- VoiceBox neural network implementation☆100Updated 5 months ago
- ☆69Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆136Updated 3 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch☆265Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆114Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆121Updated 2 years ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆120Updated 7 months ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆139Updated 10 months ago
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- ☆66Updated 4 months ago
- Official Implementation of StyleTTS-VC☆174Updated 2 weeks ago
- The reproduced code for Google's SoundStorm☆262Updated last year
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆129Updated last year
- Unofficial implementation of NVIDIA P-Flow TTS paper☆220Updated last month
- Putting flows on top of neural transducers for better TTS☆63Updated this week
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆70Updated 4 months ago
- The open source code for SimpleSpeech series☆122Updated 3 months ago
- Create training data for training a voice cloner for bark text to speech.☆43Updated last year