sony / soundctm
Pytorch implementation of SoundCTM
☆71Updated last month
Related projects ⓘ
Alternatives and complementary repositories for soundctm
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆66Updated last week
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 9 months ago
- The official Implementation of PeriodWave and PeriodWave-Turbo☆132Updated 3 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆93Updated 3 weeks ago
- Codebase and project page for EDMSound☆29Updated last year
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆83Updated 3 weeks ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆54Updated 7 months ago
- ☆40Updated 5 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆70Updated 7 months ago
- Zero-Shot Emotion Style Transfer☆37Updated 7 months ago
- ☆81Updated 2 months ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆98Updated 3 weeks ago
- All generative model in one for better TTS model☆66Updated 2 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆71Updated 2 months ago
- ☆66Updated last year
- ☆45Updated last month
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆18Updated 2 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated 2 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- An unofficial PyTorch implementation of VALL-E☆77Updated this week
- E2E TTS using Conditional Flow Matching (Experimental*)☆66Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆82Updated 2 months ago
- ☆34Updated 5 months ago
- GPT-style network for phonemization with durations of text☆62Updated 8 months ago
- FlashSpeech: Efficient Zero-Shot Speech Synthesis☆95Updated 2 months ago
- ☆57Updated 2 months ago
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆60Updated last month