Emrys365 / torch_stftLinks
PyTorch-based implementations of short-time Fourier transform
☆15Updated 6 months ago
Alternatives and similar repositories for torch_stft
Users that are interested in torch_stft are comparing it to the libraries listed below
Sorting:
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆32Updated last year
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 4 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Updated 2 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 3 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆72Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆51Updated last year
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Updated 2 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Updated 4 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- A Neural Audio Codec (NAC) for Universal Audio☆44Updated 8 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Updated 2 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Updated 3 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Updated 6 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Updated last year
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Updated 2 years ago
- This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.☆13Updated 4 years ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Updated last month
- A toolkit for researchers in the multimodal sound separation.☆16Updated 2 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Updated 2 years ago
- ICASSP2022 TTS&VC Summary☆14Updated 3 years ago
- ☆16Updated 3 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Updated 3 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Updated last year
- ☆46Updated 2 years ago
- ☆16Updated 4 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Updated last year