fschmid56 / EfficientAT_HEAR
Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.
☆25Updated last year
Alternatives and similar repositories for EfficientAT_HEAR:
Users that are interested in EfficientAT_HEAR are comparing it to the libraries listed below
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆72Updated last month
- ☆82Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆46Updated 5 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆24Updated 11 months ago
- experiments about AudioSet☆44Updated last year
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- Inference code for PaSST, using the HEAR API.☆31Updated last year
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆13Updated 5 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆35Updated 9 months ago
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆115Updated 4 months ago
- Adapting a ConvNeXt model to audio classification on AudioSet☆22Updated 2 weeks ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆26Updated 2 months ago
- EVAR ~ Evaluation package for Audio Representations☆46Updated 4 months ago
- ☆33Updated last year
- A Diffusion Probabilistic Model for Target Sound Extraction☆36Updated 5 months ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆42Updated 4 months ago
- ☆43Updated 8 months ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 8 months ago
- SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING☆37Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆87Updated 2 years ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆37Updated 8 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆57Updated 7 months ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 3 months ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆41Updated 2 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- ☆51Updated last year
- ☆13Updated last year
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 4 months ago