sony / silentcipher
☆53Updated 9 months ago
Alternatives and similar repositories for silentcipher
Users that are interested in silentcipher are comparing it to the libraries listed below
Sorting:
- Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for V…☆225Updated 9 months ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆134Updated 4 months ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆51Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆101Updated 4 months ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆78Updated 2 months ago
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.☆144Updated last month
- ☆74Updated 3 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆53Updated last year
- The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector (TAFFC 20…☆90Updated last month
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆120Updated last month
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆76Updated 5 months ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆97Updated 3 months ago
- ☆68Updated 8 months ago
- Audiogen Codec☆135Updated 10 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆90Updated last year
- Training code for FAcodec presented in NaturalSpeech3☆205Updated 8 months ago
- ☆134Updated 3 weeks ago
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆92Updated last year
- It's a repository for implementations of neural speech editing algorithms.☆198Updated last year
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆207Updated last year
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆144Updated last year
- Official Implementation of StyleTTS-VC☆179Updated 4 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆91Updated 10 months ago
- ☆38Updated 7 months ago
- Reference-aware automatic speech evaluation toolkit☆153Updated 5 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆61Updated 4 months ago
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆76Updated 7 months ago
- A sequence-to-sequence voice conversion toolkit.☆97Updated 10 months ago
- UTokyo-SaruLab MOS Prediction System☆178Updated last month