theMoro/DIRAugmentation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/theMoro/DIRAugmentation)

theMoro / DIRAugmentation

Improving Recording Device Generalization using Impulse Response Augmentation

☆21

Alternatives and similar repositories for DIRAugmentation

Users that are interested in DIRAugmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
nttcslab / dcase2025_task4_baseline
View on GitHub
☆18Apr 16, 2026Updated 3 months ago
fschmid56 / cpjku_dcase23
View on GitHub
This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"
☆32Sep 18, 2023Updated 2 years ago
CPJKU / cpjku_dcase24
View on GitHub
☆29Oct 17, 2024Updated last year
OptimusPrimus / tacos
View on GitHub
Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
☆16Oct 12, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kkoutini / passt_hear21
View on GitHub
Inference code for PaSST, using the HEAR API.
☆35Jan 2, 2024Updated 2 years ago
theMoro / EfficientSED
View on GitHub
☆22Jun 12, 2025Updated last year
fschmid56 / EfficientAT_HEAR
View on GitHub
Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.
☆34Jun 23, 2023Updated 3 years ago
kinggongzilla / DCASE2023_Task2
View on GitHub
☆23May 15, 2023Updated 3 years ago
merlresearch / sebbs
View on GitHub
Prediction of sound event bounding boxes (SEBBs)
☆35Aug 2, 2024Updated last year
fschmid56 / EfficientAT
View on GitHub
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …
☆353Nov 20, 2024Updated last year
freds0 / data_augmentation_for_asr
View on GitHub
A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
☆49Oct 15, 2021Updated 4 years ago
kkoutini / cpjku_dcase20
View on GitHub
CP-JKU submission to DCASE 20
☆44Apr 19, 2021Updated 5 years ago
CPJKU / dcase2024_task1_baseline
View on GitHub
☆10Jun 6, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
kkoutini / PaSST
View on GitHub
Efficient Training of Audio Transformers with Patchout
☆386Jan 12, 2024Updated 2 years ago
glory20h / FitHuBERT
View on GitHub
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)
☆19Nov 15, 2023Updated 2 years ago
danielgomezmarin / rhythmtoolbox
View on GitHub
Python code used to analyze and process symbolic drum patterns
☆14May 8, 2023Updated 3 years ago
jh-jeong / smoothmix
View on GitHub
Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)
☆21Sep 27, 2022Updated 3 years ago
tarepan / Scyclone-PyTorch
View on GitHub
Reproduction of "Scyclone" with PyTorch
☆16Jan 6, 2021Updated 5 years ago
RicherMans / CED
View on GitHub
Source code for Consistent ensemble distillation for audio tagging
☆75Mar 20, 2026Updated 4 months ago
felixgontier / dcase-2023-baseline
View on GitHub
☆14Mar 25, 2023Updated 3 years ago
atosystem / SSL_Interface
View on GitHub
Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024
☆16Nov 19, 2024Updated last year
yyf17 / SAAVN
View on GitHub
SAAVN Code release for paper "Sound Adversarial Audio-Visual Navigation,ICLR2022" (In PyTorch)
☆21Nov 9, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yangdongchao / Target-sound-event-detection
View on GitHub
The source code for target sound detection
☆15Feb 26, 2022Updated 4 years ago
RicherMans / Dasheng
View on GitHub
Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"
☆86Nov 7, 2025Updated 8 months ago
fschmid56 / PretrainedSED
View on GitHub
☆144May 13, 2025Updated last year
msaadsaeed / FOP
View on GitHub
Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"
☆23Dec 31, 2025Updated 6 months ago
Audio-WestlakeU / audiossl
View on GitHub
A library built for easier audio self-supervised training, downstream tasks evaluation
☆140Sep 25, 2025Updated 9 months ago
nikolakopoulos / Personalized-Diffusions
View on GitHub
Personalized Item Exploration Processes for Recommendation
☆15Sep 19, 2019Updated 6 years ago
osimarr / nary-tree
View on GitHub
A vec-backed tree structure with tree-specific generational indexes.
☆20Aug 3, 2025Updated 11 months ago
CPJKU / beat_this_annotations
View on GitHub
Beat annotations for the beat tracker Beat This!
☆14Mar 2, 2026Updated 4 months ago
SDNNetSim / FUSION
View on GitHub
FUSION is an open-source project aimed at revolutionizing networking through the simulation of advanced SD-EONs and AI-enhanced networks,…
☆15Jun 23, 2026Updated 3 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yqcai888 / easy_dcase_task1
View on GitHub
This repository provides an easy way to train your models on the datasets of DCASE task 1.
☆20May 28, 2025Updated last year
Vaibhavs10 / dcase-2023-workshop
View on GitHub
☆14Sep 20, 2023Updated 2 years ago
behzadhaki / GrooveTransformer
View on GitHub
Variational version of Monotonic Groove Transformer
☆19Sep 7, 2023Updated 2 years ago
Audio-AGI / dcase2024_task9_baseline
View on GitHub
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
monome / monome-max-package
View on GitHub
☆16Jul 10, 2025Updated last year
mavceleb / mavceleb_baseline
View on GitHub
☆11Nov 5, 2025Updated 8 months ago
MTG / discogs-vi-dataset
View on GitHub
Discogs-VI dataset and code
☆21Dec 13, 2024Updated last year