Emrys365 / torch_stft
PyTorch-based implementations of short-time Fourier transform
☆15Updated 2 years ago
Alternatives and similar repositories for torch_stft:
Users that are interested in torch_stft are comparing it to the libraries listed below
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 3 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆27Updated 6 months ago
- real-time speech enhance☆12Updated 11 months ago
- ☆16Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- A neural speech codec based on discrete WavLM representations☆22Updated 4 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated last year
- SRTNet☆24Updated last year
- A small tool to calculate the distribution of audio durations in a directory☆14Updated last year
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated last year
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆15Updated 3 months ago
- Spherical residual vector quantization (SRVQ)☆27Updated 4 months ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 2 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 5 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 5 months ago
- Mutiband version of HIFIGAN☆17Updated 4 years ago
- This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.☆11Updated 3 years ago
- The source code for the paper CrossSinger (asru2023)☆18Updated last year
- ☆14Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 5 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- Aligner for text-to-speech☆15Updated 6 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆22Updated 9 months ago
- Streaming Vocos☆19Updated last week
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆60Updated last year