Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model
☆26Apr 26, 2023Updated 2 years ago
Alternatives and similar repositories for composing-general-audio-repr
Users that are interested in composing-general-audio-repr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆31Feb 4, 2024Updated 2 years ago
- EVAR ~ Evaluation package for Audio Representations☆75Feb 19, 2026Updated last month
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Apr 20, 2024Updated last year
- MobileNetV2-based baseline system for DCASE2021 Challenge Task 2.☆24Jun 9, 2021Updated 4 years ago
- Lightweight GANを用いてラグナロクオンラインのキャラクター画像を生成するGAN☆12May 13, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Oct 2, 2020Updated 5 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆229Apr 26, 2023Updated 2 years ago
- Keras/Pytorch neural network size, operations and parameters counter☆16Mar 23, 2023Updated 3 years ago
- ☆16Dec 7, 2022Updated 3 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Dec 30, 2017Updated 8 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Apr 29, 2025Updated 10 months ago
- A lightwight Framework for the Respiratory Sound Classification☆11Feb 12, 2025Updated last year
- Chainer implementation of between-class learning for sound recognition https://arxiv.org/abs/1711.10282☆96Mar 27, 2018Updated 7 years ago
- [CVPR 2023]Implementation of Siamese Image Modeling for Self-Supervised Vision Representation Learning☆41Jun 6, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Voice Analysis Pipeline for DigiPsych Lab☆10Sep 15, 2019Updated 6 years ago
- Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)☆73Mar 11, 2025Updated last year
- Pytorch implementation of additive margin softmax loss☆12Aug 5, 2021Updated 4 years ago
- ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions 🚗 🚃☆21Apr 16, 2024Updated last year
- An original package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆14Oct 27, 2024Updated last year
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- ☆14Aug 30, 2022Updated 3 years ago
- GitHub repo for FSE 2021 Paper - ``Bias in Machine Learning Software: Why? How? What to do?''☆16May 7, 2022Updated 3 years ago
- ☆13Mar 10, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Feb 15, 2026Updated last month
- ☆14Sep 20, 2023Updated 2 years ago
- ☆14Nov 22, 2022Updated 3 years ago
- Autoencoder-based baseline system for DCASE2021 Challenge Task 2.☆27Jun 9, 2021Updated 4 years ago
- ☆17Nov 15, 2021Updated 4 years ago
- ☆13Feb 26, 2024Updated 2 years ago
- ☆20Aug 19, 2021Updated 4 years ago
- ☆10Apr 22, 2016Updated 9 years ago
- Ensemble of Exemplar-SVMs for Object Detection and Beyond☆21Mar 24, 2014Updated 12 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆12Jun 9, 2025Updated 9 months ago
- Multi-lingual AudioCaps☆12Nov 20, 2023Updated 2 years ago
- The code for DCASE2021 task5 submission.☆20Feb 21, 2022Updated 4 years ago
- This repository includes the code to reproduce our paper "Raw Differentiable Architecture Search for Speech Deepfake and Spoofing Detecti…☆11Jul 11, 2023Updated 2 years ago
- Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICC…☆15Nov 5, 2023Updated 2 years ago
- Deep Speech Distances PyTorch☆29Feb 21, 2022Updated 4 years ago
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆23Aug 20, 2023Updated 2 years ago