ORI-Muchim / AudioSR-Upsampling
AudioSR-Upsampling (any -> 48kHz)
☆40Updated last year
Alternatives and similar repositories for AudioSR-Upsampling:
Users that are interested in AudioSR-Upsampling are comparing it to the libraries listed below
- ☆43Updated 10 months ago
- ☆45Updated 3 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆59Updated 2 months ago
- Unofficial implementation of wavenext vocoder☆44Updated 7 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆60Updated 2 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 10 months ago
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆23Updated 3 weeks ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Updated last year
- Implementation of Emo-StarGAN☆45Updated last year
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆89Updated 9 months ago
- Official implementation for FlowSep☆40Updated 3 months ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆41Updated 2 weeks ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- BigVGAN with Neural Source-Filter☆54Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆84Updated 3 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆33Updated last year
- The source code for the paper XiaoiceSing2 (interspeech2023)☆47Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆38Updated 9 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆53Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆51Updated 5 months ago
- ☆48Updated last week
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Updated 2 years ago
- High quality text-to-speech based on StyleTTS 2.☆35Updated this week
- Prosody and Pronunciation Modification Network☆52Updated 2 weeks ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆15Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- ☆39Updated last year