nttcslab/eval-audio-repr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nttcslab/eval-audio-repr)

nttcslab / eval-audio-repr

EVAR ~ Evaluation package for Audio Representations

☆81

Alternatives and similar repositories for eval-audio-repr

Users that are interested in eval-audio-repr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nttcslab / m2d
View on GitHub
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
☆162Feb 23, 2026Updated 5 months ago
nttcslab / byol-a
View on GitHub
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
☆237Apr 26, 2023Updated 3 years ago
nttcslab / msm-mae
View on GitHub
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations
☆99Feb 20, 2026Updated 5 months ago
nttcslab / dcase2023_task2_evaluator
View on GitHub
☆12Aug 10, 2023Updated 2 years ago
evelyn0414 / OPERA
View on GitHub
This is the official code release for OPERA: OPEn Respiratory Acoustic foundation models
☆83Mar 11, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
torchopenl3 / torchopenl3
View on GitHub
☆20Aug 26, 2022Updated 3 years ago
SonyCSLParis / Stem-JEPA
View on GitHub
Joint Embedding Predictive Architecture for Musical Stem Compatibility Estimation
☆55Aug 6, 2024Updated last year
Torabiy / HLS-CMDS
View on GitHub
Heart and Lung Sounds Dataset Recorded from a Clinical Manikin using Digital Stethoscope (HLS-CMDS)
☆19May 13, 2026Updated 2 months ago
jimbozhang / xares
View on GitHub
A benchmark for evaluating audio encoders on various audio tasks.
☆55Apr 27, 2026Updated 2 months ago
Benjamin-Walker / heart-murmur-detection
View on GitHub
Dual Bayesian ResNet: A Deep Learning Approach to Heart Murmur Detection (Physionet Challenge 2022)
☆23Oct 1, 2025Updated 9 months ago
fschmid56 / PretrainedSED
View on GitHub
☆145May 13, 2025Updated last year
Sara-Ahmed / ASiT
View on GitHub
ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation
☆30Mar 10, 2024Updated 2 years ago
Neclow / SERAB
View on GitHub
SERAB: a multi-lingual benchmark for speech emotion recognition
☆28Dec 16, 2022Updated 3 years ago
yluo42 / SRVQ
View on GitHub
Spherical residual vector quantization (SRVQ)
☆31Aug 25, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
RicherMans / Dasheng
View on GitHub
Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"
☆86Nov 7, 2025Updated 8 months ago
xiquan-li / TinyMU
View on GitHub
[ICASSP 2026] TinyMU: A Compact Audio Language Model for Music Understanding
☆36Apr 20, 2026Updated 3 months ago
facebookresearch / AudioMAE
View on GitHub
This repo hosts the code and models of "Masked Autoencoders that Listen".
☆673Apr 5, 2024Updated 2 years ago
LudovicTuncay / Audio-JEPA
View on GitHub
Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…
☆65Jul 16, 2026Updated last week
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
nttcslab / composing-general-audio-repr
View on GitHub
Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model
☆26Apr 26, 2023Updated 3 years ago
hearbenchmark / hear-eval-kit
View on GitHub
Evaluation kit for the HEAR Benchmark
☆65Feb 12, 2026Updated 5 months ago
habla-liaa / encodecmae
View on GitHub
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
☆101Jul 24, 2024Updated 2 years ago
antonioalmudevar / dcase2022_task2
View on GitHub
☆11Jul 6, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
line / WaveTrainerFit
View on GitHub
Official implementation of "Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech G…
☆16Feb 6, 2026Updated 5 months ago
johnmartinsson / differentiable-mel-spectrogram
View on GitHub
The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …
☆24Dec 21, 2024Updated last year
ta012 / DTFAT
View on GitHub
[AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification
☆12Mar 10, 2025Updated last year
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
fakufaku / torchiva
View on GitHub
Blind source separation with independent vector analysis family of algorithm in torch
☆108Jan 30, 2023Updated 3 years ago
SonyCSLParis / audio-representations
View on GitHub
JEPAs for audio representation learning
☆26Jun 11, 2026Updated last month
qiuqiangkong / audioflow
View on GitHub
☆130Updated this week
barisbozkurt / MASTmelody_dataset
View on GitHub
A dataset of pitch curves for music performance assessment
☆11Jun 5, 2023Updated 3 years ago
kaen2891 / adversarial_fine-tuning_using_generated_respiratory_sound
View on GitHub
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…
☆19Dec 5, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Wataru-Nakata / latentlm-tts
View on GitHub
☆29Jul 3, 2026Updated 3 weeks ago
fgnt / sed_scores_eval
View on GitHub
☆41Feb 18, 2026Updated 5 months ago
SarthakYadav / axlstm-official
View on GitHub
Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"
☆21Sep 7, 2025Updated 10 months ago
AlanBaade / MAE-AST-Public
View on GitHub
Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
☆93Jun 9, 2022Updated 4 years ago
unilight / jatts
View on GitHub
JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit
☆43Mar 13, 2026Updated 4 months ago
CVxTz / COLA_pytorch
View on GitHub
COLA contrastive pre-training method implemented in PyTorch
☆44Jan 27, 2021Updated 5 years ago
kaen2891 / stethoscope-guided_supervised_contrastive_learning
View on GitHub
(ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory So…
☆18Dec 5, 2024Updated last year