fschmid56 / EfficientAT_HEAR
Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.
☆23Updated last year
Related projects: ⓘ
- Learning differentiable temporal resolution on time-series data.☆33Updated last year
- Inference code for PaSST, using the HEAR API.☆28Updated 8 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆22Updated 3 months ago
- experiments about AudioSet☆43Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆21Updated 5 months ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆20Updated 6 months ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆39Updated last year
- A Diffusion Probabilistic Model for Target Sound Extraction☆29Updated 4 months ago
- Code for paper Learning Audio-Visual Dereverberation☆25Updated 2 years ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆41Updated last week
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆66Updated last month
- ARCH: Audio Representations benCHmark☆25Updated 3 weeks ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆40Updated 2 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆82Updated 2 years ago
- This package aims at simplifying the download of the AudioSet dataset.☆38Updated 11 months ago
- ☆32Updated last year
- SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING☆31Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated 11 months ago
- EVAR ~ Evaluation package for Audio Representations☆41Updated last month
- Adapting a ConvNeXt model to audio classification on AudioSet☆17Updated 11 months ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 2 years ago
- This code is to run the WARP-Q speech quality metric.☆34Updated last year
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆38Updated last year
- For students who would like to apply for RA, PhD, postdoc in audio research.☆22Updated 11 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆42Updated 2 months ago
- ☆48Updated 2 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆29Updated 8 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆92Updated 3 weeks ago
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆90Updated this week