Emrys365 / torch_stftLinks
PyTorch-based implementations of short-time Fourier transform
☆15Updated 3 weeks ago
Alternatives and similar repositories for torch_stft
Users that are interested in torch_stft are comparing it to the libraries listed below
Sorting:
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆31Updated last year
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Updated last year
- his code is a pytorch version for CycleFlow model in "CycleFlow: Purify Information Factors by Cycle Loss"☆15Updated 3 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 3 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆44Updated 2 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 3 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆61Updated 2 years ago
- Simple sinc interpolation in PyTorch.☆14Updated 2 years ago
- A Neural Audio Codec (NAC) for Universal Audio☆39Updated 2 months ago
- Streaming Audiotransformers for online Audio tagging☆46Updated last year
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 4 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆55Updated 2 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Updated 11 months ago
- Mutiband version of HIFIGAN☆18Updated 4 years ago
- ICASSP2022 TTS&VC Summary☆14Updated 3 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆69Updated 3 years ago
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆41Updated 2 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Updated 5 years ago
- Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT s…☆56Updated last week
- Based on https://github.com/fatchord/WaveRNN☆24Updated 5 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Updated 2 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Updated 2 years ago
- a pytorch implementation of Google GEDLoss☆32Updated 4 years ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated last year
- This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.☆13Updated 3 years ago
- ☆22Updated 2 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆14Updated 2 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Updated 3 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 9 months ago