nttcslab/composing-general-audio-repr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nttcslab/composing-general-audio-repr)

nttcslab / composing-general-audio-repr

Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model

☆26

Alternatives and similar repositories for composing-general-audio-repr

Users that are interested in composing-general-audio-repr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nttcslab / Generalized-Domain-Adaptation
View on GitHub
☆12Jun 18, 2021Updated 5 years ago
Torabiy / HLS-CMDS
View on GitHub
Heart and Lung Sounds Dataset Recorded from a Clinical Manikin using Digital Stethoscope (HLS-CMDS)
☆19May 13, 2026Updated 2 months ago
ilyassmoummad / scl_icbhi2017
View on GitHub
PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)
☆33Feb 4, 2024Updated 2 years ago
GasserElbanna / serab-byols
View on GitHub
(Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.
☆27Apr 20, 2024Updated 2 years ago
TaikiMiyagawa / SPRT-TANDEM
View on GitHub
Tensorflow 2.0.0 implementation of SPRT-TANDEM
☆13Jun 21, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
y-kawagu / dcase2021_task2_baseline_mobile_net_v2
View on GitHub
MobileNetV2-based baseline system for DCASE2021 Challenge Task 2.
☆25Jun 9, 2021Updated 5 years ago
nttcslab / eval-audio-repr
View on GitHub
EVAR ~ Evaluation package for Audio Representations
☆81Feb 19, 2026Updated 5 months ago
midas-research / speechmix
View on GitHub
☆12Oct 2, 2020Updated 5 years ago
AlbertoAncilotto / NeSsi
View on GitHub
Keras/Pytorch neural network size, operations and parameters counter
☆16Mar 23, 2023Updated 3 years ago
Kazuhito00 / RO-GAN-using-Lightweight-GAN
View on GitHub
Lightweight GANを用いてラグナロクオンラインのキャラクター画像を生成するGAN
☆12May 13, 2021Updated 5 years ago
nttcslab / byol-a
View on GitHub
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
☆237Apr 26, 2023Updated 3 years ago
JinhuaLiang / LaD-ProtoNet
View on GitHub
☆16Sep 14, 2023Updated 2 years ago
SarthakYadav / axlstm-official
View on GitHub
Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"
☆21Sep 7, 2025Updated 10 months ago
g9nkn / p3p_problem
View on GitHub
An official MATLAB implementation of the paper "A Simple Direct Solution to the Perspective-Three-Point Problem", BMVC2019.
☆14Oct 1, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ta012 / DTFAT
View on GitHub
[AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification
☆12Mar 10, 2025Updated last year
AMLAB-Wakayama / gammachirp-filterbank
View on GitHub
An original package of the dynamic compressive gammachirp filterbank (dcGC-FB)
☆14Jul 7, 2026Updated 2 weeks ago
lijuncheng16 / AudioTaggingDoneRight
View on GitHub
experiments about AudioSet
☆43Jul 22, 2023Updated 3 years ago
mil-tokyo / bc_learning_sound
View on GitHub
Chainer implementation of between-class learning for sound recognition https://arxiv.org/abs/1711.10282
☆95Mar 27, 2018Updated 8 years ago
evelyn0414 / OPERA
View on GitHub
This is the official code release for OPERA: OPEn Respiratory Acoustic foundation models
☆83Mar 11, 2025Updated last year
raymin0223 / patch-mix_contrastive_learning
View on GitHub
Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)
☆76Mar 11, 2025Updated last year
OpenGVLab / Siamese-Image-Modeling
View on GitHub
[CVPR 2023]Implementation of Siamese Image Modeling for Self-Supervised Vision Representation Learning
☆41Jun 6, 2024Updated 2 years ago
nttcslab / ToyADMOS2-dataset
View on GitHub
ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions 🚗 🚃
☆21Apr 16, 2024Updated 2 years ago
RicherMans / PSL
View on GitHub
Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"
☆31Apr 29, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Hadryan / TFNet-for-Environmental-Sound-Classification
View on GitHub
Learning discriminative and robust time-frequency representations for environmental sound classification: Convolutional neural networks (…
☆31Dec 19, 2019Updated 6 years ago
joymallyac / Fair-SMOTE
View on GitHub
GitHub repo for FSE 2021 Paper - ``Bias in Machine Learning Software: Why? How? What to do?''
☆17May 7, 2022Updated 4 years ago
lucidrains / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆15May 18, 2021Updated 5 years ago
sungnyun / cav2vec
View on GitHub
(ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
☆16Apr 29, 2025Updated last year
hearbenchmark / hear2021-submitted-models
View on GitHub
Open-source audio embedding models, submitted to the HEAR 2021 challenge
☆11Feb 15, 2026Updated 5 months ago
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
aeesha-T / parkinsons_prediction_using_speech
View on GitHub
☆18Nov 15, 2021Updated 4 years ago
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
y-kawagu / dcase2021_task2_baseline_ae
View on GitHub
Autoencoder-based baseline system for DCASE2021 Challenge Task 2.
☆27Jun 9, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dzryk / cliptalk
View on GitHub
☆19Aug 19, 2021Updated 4 years ago
Vaibhavs10 / dcase-2023-workshop
View on GitHub
☆14Sep 20, 2023Updated 2 years ago
Kazuhito00 / simple-virtual-mouse-using-mediapipe
View on GitHub
MediaPipeを用いたハンドジェスチャーによる簡単なマウス操作を行うプログラムです。
☆12Mar 17, 2021Updated 5 years ago
Zzzzz1 / CSKD
View on GitHub
Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICC…
☆15Nov 5, 2023Updated 2 years ago
eurecom-asp / raw-pc-darts-anti-spoofing
View on GitHub
This repository includes the code to reproduce our paper "Raw Differentiable Architecture Search for Speech Deepfake and Spoofing Detecti…
☆11Jul 11, 2023Updated 3 years ago
yangdongchao / DCASE2021Task5
View on GitHub
The code for DCASE2021 task5 submission.
☆20Feb 21, 2022Updated 4 years ago
AndreevP / speech_distances
View on GitHub
Deep Speech Distances PyTorch
☆29Feb 21, 2022Updated 4 years ago