nttcslab / eval-audio-repr
EVAR ~ Evaluation package for Audio Representations
☆47Updated 4 months ago
Alternatives and similar repositories for eval-audio-repr:
Users that are interested in eval-audio-repr are comparing it to the libraries listed below
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Inference code for PaSST, using the HEAR API.☆31Updated last year
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆89Updated 7 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆36Updated 9 months ago
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆14Updated 6 months ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆15Updated 3 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆114Updated 7 months ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆41Updated last year
- experiments about AudioSet☆44Updated last year
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆90Updated 9 months ago
- Official code of ElasticAST (Interspeech 2024 paper)☆29Updated 7 months ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆26Updated last year
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 5 months ago
- ☆36Updated this week
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆87Updated 2 years ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆14Updated last year
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆42Updated 2 years ago
- ☆47Updated 3 months ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- This package aims at simplifying the download of the AudioSet dataset.☆48Updated last year
- ARCH: Audio Representations benCHmark☆43Updated 7 months ago
- Evaluation kit for the HEAR Benchmark☆58Updated 2 weeks ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated 11 months ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆100Updated last year
- ☆18Updated 3 years ago
- Adapting a ConvNeXt model to audio classification on AudioSet☆22Updated last month
- Official implementation of EfficientLEAF, a learnable audio frontend.☆40Updated 2 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆132Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆36Updated 6 months ago