Emrys365 / torch_stftLinks
PyTorch-based implementations of short-time Fourier transform
☆15Updated 6 months ago
Alternatives and similar repositories for torch_stft
Users that are interested in torch_stft are comparing it to the libraries listed below
Sorting:
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆32Updated last year
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 3 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Updated 2 years ago
- his code is a pytorch version for CycleFlow model in "CycleFlow: Purify Information Factors by Cycle Loss"☆15Updated 4 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated 2 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆72Updated 3 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Updated 2 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆51Updated last year
- Based on https://github.com/fatchord/WaveRNN☆24Updated 5 years ago
- A Neural Audio Codec (NAC) for Universal Audio☆44Updated 8 months ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.☆13Updated 4 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Updated 3 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 4 years ago
- ICASSP2022 TTS&VC Summary☆14Updated 3 years ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Updated 5 years ago
- ☆16Updated 3 years ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆29Updated 2 years ago
- Parallel waveform generation with DiffusionGAN☆17Updated 3 years ago
- ☆66Updated 2 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Updated last year
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Updated 3 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆14Updated 3 years ago
- Simple sinc interpolation in PyTorch.☆15Updated 2 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Updated 2 years ago
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆43Updated 3 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Updated 2 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Updated last year
- My vocoder experiments☆31Updated 6 months ago