cdezapasquale / transfomer-audio-classificationLinks
small experimentation about positional encoding
☆19Updated 5 years ago
Alternatives and similar repositories for transfomer-audio-classification
Users that are interested in transfomer-audio-classification are comparing it to the libraries listed below
Sorting:
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Updated 4 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆65Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- Audio transformations library for PyTorch☆233Updated 3 years ago
- The official repository for Audio ALBERT☆67Updated 3 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆224Updated 2 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Updated 2 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆145Updated 3 years ago
- Pytorch implementation of [Learning to match transient sound events using attentional similarity for few-shot sound recognition]☆33Updated 6 years ago
- Freesound Audio Tagging 2019☆95Updated 6 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago
- ☆54Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆76Updated 3 years ago
- A collection of Audio and Speech pre-trained models.☆194Updated 5 years ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆17Updated 3 years ago
- Evaluation kit for the HEAR Benchmark☆62Updated this week
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆74Updated 4 years ago
- Learn and L3 embedding from audio/video pairs☆88Updated 3 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆104Updated 2 years ago
- PyTorch reimplementation of per-channel energy normalization for audio.☆103Updated 6 years ago
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆98Updated 7 months ago
- A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation☆137Updated last year
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆131Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- EVAR ~ Evaluation package for Audio Representations☆68Updated last week
- Audio data augmentation examples☆34Updated 7 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆90Updated 5 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Updated 2 years ago