DakeQQ / STFT-ISTFT-ONNX
Export the STFT or ISTFT process in ONNX format.
☆19Updated last week
Alternatives and similar repositories for STFT-ISTFT-ONNX:
Users that are interested in STFT-ISTFT-ONNX are comparing it to the libraries listed below
- Prosody and Pronunciation Modification Network☆47Updated 5 months ago
- Unofficial implementation of wavenext vocoder☆40Updated 5 months ago
- ☆38Updated 4 months ago
- Megatts2 use HierSpeechpp's vocoder☆17Updated last month
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 7 months ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 11 months ago
- dog-can-sing-song☆18Updated 2 months ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆86Updated 6 months ago
- Alignment examples for Interspeech 2024☆18Updated 6 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 3 months ago
- ☆12Updated 5 months ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆24Updated 4 months ago
- ☆24Updated 2 weeks ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆29Updated last week
- ☆46Updated 2 months ago
- ☆43Updated 7 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆37Updated 2 months ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆79Updated 5 months ago
- ☆18Updated 8 months ago
- ☆18Updated 4 months ago
- ☆21Updated 5 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆39Updated 2 weeks ago
- a lightweight voice conversion☆78Updated 4 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆63Updated 9 months ago
- ☆44Updated last year
- Just another FastSpeech 2 but cleaner code :)☆25Updated 7 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆51Updated 2 months ago
- ☆65Updated last week
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year