facebookresearch / tdfbanksLinks
Pytorch implementation of time-domain filterbanks
☆112Updated 4 years ago
Alternatives and similar repositories for tdfbanks
Users that are interested in tdfbanks are comparing it to the libraries listed below
Sorting:
- Utils and data sets for audio and PyTorch☆86Updated 4 years ago
- This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and…☆52Updated 7 years ago
- Time Delayed NN implemented in pytorch☆81Updated 8 years ago
- A test bed for updates and new features | pytorch/audio☆170Updated 5 years ago
- This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable R…☆156Updated 7 years ago
- audio processing module for pytorch:stft, istft☆36Updated 6 years ago
- Tensorflow implementation of the speech model described in Neural Discrete Representation Learning (a.k.a. VQ-VAE)☆128Updated 7 years ago
- Wavenet Autoencoder for Unsupervised speech representation learning (after Chorowski, Jan 2019)☆176Updated 5 years ago
- This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders…☆53Updated 7 years ago
- Fetch and use Google's AudioSet dataset☆126Updated 8 years ago
- Pytorch and TensorFlow data loaders for several audio datasets☆113Updated 5 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆99Updated 6 years ago
- FFTNet vocoder implementation☆81Updated 7 years ago
- Learn and L3 embedding from audio/video pairs☆88Updated 3 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 6 years ago
- PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)☆81Updated 7 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆75Updated 4 years ago
- ☆99Updated 8 years ago
- Dataset and baseline for the first Audiocaption task☆79Updated last year
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 7 years ago
- Chainer implementation of between-class learning for sound recognition https://arxiv.org/abs/1711.10282☆96Updated 7 years ago
- A fast cnn-based vocoder☆78Updated 5 years ago
- ☆27Updated 7 years ago
- The code for the MaD TwinNet. Demo page:☆112Updated 2 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆125Updated 6 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorch☆88Updated last year
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- Voxceleb1 i-vector based speaker recognition system☆44Updated 7 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆40Updated 7 years ago