pseeth / torch-stft
An STFT/iSTFT for PyTorch.
☆356Updated last year
Alternatives and similar repositories for torch-stft:
Users that are interested in torch-stft are comparing it to the libraries listed below
- Speech Enhancement Generative Adversarial Network in PyTorch☆388Updated last year
- A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…☆209Updated last year
- SEGAN pytorch implementation https://arxiv.org/abs/1703.09452☆108Updated 6 years ago
- ☆298Updated 5 years ago
- Improved Wave-U-Net implemented in Pytorch☆329Updated 8 months ago
- Deep learning based speech source separation using Pytorch☆316Updated 4 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆218Updated 2 years ago
- A library for speech data augmentation in time-domain☆656Updated 3 years ago
- Audio Denoising with Deep Network Priors☆163Updated 4 years ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆492Updated 3 years ago
- A test bed for updates and new features | pytorch/audio☆169Updated 4 years ago
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆360Updated 2 years ago
- An open-source speech separation and enhancement library☆211Updated 4 years ago
- Speech Denoising with Deep Feature Losses☆186Updated 4 years ago
- A PyTorch implementation of DNN-based source separation.☆296Updated 3 years ago
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆327Updated 2 years ago
- Python implementation of the Short Term Objective Intelligibility measure☆337Updated last year
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆507Updated 3 years ago
- Tools for Speech Enhancement integrated with Kaldi☆410Updated last year
- An open source dataset for source separation☆410Updated last year
- transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.☆269Updated 3 years ago
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆508Updated last week
- A pure python module for reading and writing kaldi ark files☆256Updated 3 weeks ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Updated 2 years ago
- Time delay neural network (TDNN) implementation in Pytorch using unfold method☆201Updated 5 years ago
- Problem Agnostic Speech Encoder☆440Updated last year
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Updated last year
- ☆226Updated 5 years ago
- Authors' implementation of DeepSpeech Distances.☆129Updated 4 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆357Updated 8 months ago