cdezapasquale / transfomer-audio-classification
small experimentation about positional encoding
☆17Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for transfomer-audio-classification
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- The official repository for Audio ALBERT☆64Updated 2 years ago
- Baseline systems for the FSD50K dataset☆67Updated 3 years ago
- Evaluation kit for the HEAR Benchmark☆56Updated 3 weeks ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆60Updated 2 months ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated last year
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Updated last year
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 4 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆89Updated 2 years ago
- ☆25Updated 2 years ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆16Updated 2 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆30Updated 6 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆123Updated 3 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- ☆53Updated 4 years ago
- Control mechanisms to the U-Net architecture for doing multiple source separation instruments☆48Updated 4 years ago
- Classify the emotions from variable-length speech segments☆11Updated 6 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- ☆17Updated 2 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Updated 3 months ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆89Updated 5 months ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 2 years ago
- A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation☆137Updated 3 months ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆98Updated last year
- EVAR ~ Evaluation package for Audio Representations☆43Updated 2 weeks ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- experiments about AudioSet☆43Updated last year
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆76Updated 3 months ago
- Pytorch port of Google Research's LEAF Audio paper☆92Updated 3 years ago