an-tran528 / wavetransformerLinks
Code base for WaveTransformer: A novel architecture for automated audio captioning
☆43Updated 4 years ago
Alternatives and similar repositories for wavetransformer
Users that are interested in wavetransformer are comparing it to the libraries listed below
Sorting:
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated 2 years ago
- PyTorch implementation of the Feed-Forward Attention Mechanism.☆18Updated 6 years ago
- Zero-shot Learning for Audio-based Music Classification and Tagging (ISMIR 2019)☆41Updated 5 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆34Updated 7 years ago
- ☆32Updated 4 years ago
- An audio classification system for learning with out-of-distribution data☆33Updated 2 years ago
- Audio captioning baseline system for DCASE 2020 challenge.☆38Updated last year
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- ☆29Updated 5 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 4 years ago
- Python code for handling the Clotho dataset.☆82Updated 4 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Code accompanying ML4MD ICML 2020 paper - "Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance".☆30Updated 4 years ago
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆19Updated 3 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 4 years ago
- ☆18Updated 4 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Experiments and tutorials with and for torchaudio☆13Updated 4 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Updated 5 months ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Updated 10 months ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- The training code for the 4th place model at MDX 2021 leaderboard A.☆36Updated 3 years ago
- Simple baseline model for the HEAR benchmark☆23Updated 2 months ago
- Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…☆14Updated 4 years ago
- a pytorch implementation of Google GEDLoss☆32Updated 4 years ago
- This repo contains code for comparing audio representation sin the task of audio synthesis wth Generative Adversarial Networks (GAN)☆37Updated 2 years ago