fschmid56/EfficientAT_HEAR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fschmid56/EfficientAT_HEAR)

fschmid56 / EfficientAT_HEAR

Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.

☆34

Alternatives and similar repositories for EfficientAT_HEAR

Users that are interested in EfficientAT_HEAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kkoutini / passt_hear21
View on GitHub
Inference code for PaSST, using the HEAR API.
☆35Jan 2, 2024Updated 2 years ago
fschmid56 / EfficientAT
View on GitHub
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …
☆353Nov 20, 2024Updated last year
fschmid56 / cpjku_dcase23
View on GitHub
This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"
☆32Sep 18, 2023Updated 2 years ago
OptimusPrimus / tacos
View on GitHub
Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
☆16Oct 12, 2025Updated 9 months ago
CPJKU / cpjku_dcase22
View on GitHub
☆19Jul 15, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
merlresearch / sebbs
View on GitHub
Prediction of sound event bounding boxes (SEBBs)
☆35Aug 2, 2024Updated last year
kkoutini / PaSST
View on GitHub
Efficient Training of Audio Transformers with Patchout
☆386Jan 12, 2024Updated 2 years ago
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
RicherMans / CED
View on GitHub
Source code for Consistent ensemble distillation for audio tagging
☆75Mar 20, 2026Updated 4 months ago
CPJKU / cpjku_dcase24
View on GitHub
☆29Oct 17, 2024Updated last year
hearbenchmark / hear-eval-kit
View on GitHub
Evaluation kit for the HEAR Benchmark
☆65Feb 12, 2026Updated 5 months ago
theMoro / DIRAugmentation
View on GitHub
Improving Recording Device Generalization using Impulse Response Augmentation
☆21Apr 24, 2025Updated last year
theMoro / EfficientSED
View on GitHub
☆22Jun 12, 2025Updated last year
Audio-WestlakeU / audiossl
View on GitHub
A library built for easier audio self-supervised training, downstream tasks evaluation
☆140Sep 25, 2025Updated 10 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ilaria-manco / mulap
View on GitHub
Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)
☆47Dec 3, 2024Updated last year
Dream-High / DJCM
View on GitHub
☆30Apr 22, 2024Updated 2 years ago
minzwon / semi-supervised-music-tagging-transformer
View on GitHub
☆99Nov 25, 2021Updated 4 years ago
xjuspeech / YOLOPitch
View on GitHub
☆10Jun 11, 2024Updated 2 years ago
GasserElbanna / serab-byols
View on GitHub
(Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.
☆27Apr 20, 2024Updated 2 years ago
seungheondoh / music-text-representation-pp
View on GitHub
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]
☆43Oct 7, 2024Updated last year
groupmm / libf0
View on GitHub
A Python Library for Fundamental Frequency Estimation in Music Recordings
☆55Jun 5, 2026Updated last month
seungheondoh / musical-word-embedding
View on GitHub
Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]
☆29Apr 23, 2024Updated 2 years ago
prompteus / audio-captioning
View on GitHub
Audio captioning - DCASE challenge 2023 task 6a
☆30Dec 26, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
CarlWangChina / SongDriver-Real-time-Music-Accompaniment-Generation-without-Logical-Latency-nor-Exposure-Bias
View on GitHub
SongDriver uses a parallel mechanism of prediction and arrangement phases to achieve zero logical latency in real-time accompaniment gene…
☆16Jan 5, 2026Updated 6 months ago
CPJKU / dcase2024_task1_baseline
View on GitHub
☆10Jun 6, 2024Updated 2 years ago
ETH-DISCO / discoder
View on GitHub
Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025
☆42Feb 24, 2025Updated last year
JuanFMontesinos / VoViT
View on GitHub
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
☆35Mar 18, 2023Updated 3 years ago
BUTSpeechFIT / SOT-DiCoW
View on GitHub
Multi-talker ASR based on DiCoW with Serialized Output Training
☆20Sep 18, 2025Updated 10 months ago
kyegomez / Audio-xLSTMs
View on GitHub
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆20Updated this week
gabolsgabs / cunet
View on GitHub
Control mechanisms to the U-Net architecture for doing multiple source separation instruments
☆55Jun 1, 2020Updated 6 years ago
zengchang233 / CrossSinger
View on GitHub
The source code for the paper CrossSinger (asru2023)
☆18Oct 12, 2023Updated 2 years ago
holken / polite
View on GitHub
code for polite
☆12Feb 28, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
XZWY / MSLDM
View on GitHub
Implementation of Multi-Source Music Generation with Latent Diffusion.
☆29Sep 12, 2024Updated last year
nttcslab / msm-mae
View on GitHub
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations
☆99Feb 20, 2026Updated 5 months ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
rodrigo-castellon / jukemirlib
View on GitHub
A simple library for extracting representations from Jukebox
☆39Nov 16, 2025Updated 8 months ago
sungnyun / ARMHuBERT
View on GitHub
(Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT
☆41Aug 29, 2024Updated last year
google-deepmind / slowfast_nfnets
View on GitHub
☆30Jun 22, 2022Updated 4 years ago
mdx-tutorial / mdx-tutorial.github.io
View on GitHub
Tutorial covering Open Source tools for Source Separation.
☆15Nov 12, 2021Updated 4 years ago