an-tran528 / wavetransformer
Code base for WaveTransformer: A novel architecture for automated audio captioning
☆43Updated 4 years ago
Alternatives and similar repositories for wavetransformer
Users that are interested in wavetransformer are comparing it to the libraries listed below
Sorting:
- ☆29Updated 5 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 4 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated 2 years ago
- Zero-shot Learning for Audio-based Music Classification and Tagging (ISMIR 2019)☆41Updated 5 years ago
- ☆18Updated 4 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Audio captioning baseline system for DCASE 2020 challenge.☆38Updated last year
- An audio classification system for learning with out-of-distribution data☆33Updated 2 years ago
- a pytorch implementation of Google GEDLoss☆32Updated 4 years ago
- Control mechanisms to the U-Net architecture for doing multiple source separation instruments☆51Updated 4 years ago
- ☆32Updated 4 years ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆34Updated 7 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- ☆12Updated 7 years ago
- A list of resources that can help in research for automated audio captioning☆34Updated 4 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 3 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- ☆10Updated last year
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Updated 4 months ago
- Code accompanying ML4MD ICML 2020 paper - "Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance".☆30Updated 4 years ago
- An implement of SPEECHSPLIT☆15Updated 4 years ago
- The code for the ISMIR 2019 paper “Supervised symbolic music style translation using synthetic data”.☆27Updated 2 years ago
- The training code for the 4th place model at MDX 2021 leaderboard A.☆35Updated 3 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- Language modelling for sound event detection☆20Updated 5 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 4 months ago
- WaveNet implementation using tf.estimator☆21Updated last year
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago