nttcslab/dcase2025_task4_baseline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nttcslab/dcase2025_task4_baseline)

nttcslab / dcase2025_task4_baseline

☆18

Alternatives and similar repositories for dcase2025_task4_baseline

Users that are interested in dcase2025_task4_baseline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
fschmid56 / cpjku_dcase23
View on GitHub
This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"
☆32Sep 18, 2023Updated 2 years ago
theMoro / DIRAugmentation
View on GitHub
Improving Recording Device Generalization using Impulse Response Augmentation
☆21Apr 24, 2025Updated last year
donghoney0416 / DeepASA
View on GitHub
Official page of "DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis"
☆26Apr 15, 2026Updated 3 months ago
CPJKU / cpjku_dcase24
View on GitHub
☆29Oct 17, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
nttrd-mdlab / wearable-seld-dataset
View on GitHub
☆10Feb 18, 2022Updated 4 years ago
muuda / MFF-EINV2
View on GitHub
MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection
☆22Jul 17, 2024Updated 2 years ago
jiwonix / Sound-Event-Detection-papers
View on GitHub
Sound Event Detection (SED) paper collection
☆15Jun 26, 2024Updated 2 years ago
frednam93 / MDFD-SED
View on GitHub
☆21Mar 6, 2025Updated last year
SmartSoundKAIST / 6DRIR-DL
View on GitHub
6 DoF Directional Room Impulse Response (RIR) with Dense Loudspeaker Grid
☆17Aug 31, 2023Updated 2 years ago
Orlllem / seld_wav2vec2
View on GitHub
☆18Feb 1, 2026Updated 5 months ago
Aisaka0v0 / CLAPSep
View on GitHub
Query-conditioned target sound extraction model
☆30Mar 25, 2025Updated last year
fschmid56 / PretrainedSED
View on GitHub
☆144May 13, 2025Updated last year
apple-yinhan / TQ-SED
View on GitHub
☆23Mar 19, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
merlresearch / sebbs
View on GitHub
Prediction of sound event bounding boxes (SEBBs)
☆35Aug 2, 2024Updated last year
cai525 / Transformer4SED
View on GitHub
This repository aims to collect Transformer-based sound event detection (SED) algorithms.
☆104Feb 10, 2026Updated 5 months ago
Phuriches / GenRepASD
View on GitHub
Pytorch implementation of Deep Generic Representations for Domain-Generalized Anomalous Sound Detection: https://arxiv.org/abs/2409.05035
☆28Mar 16, 2025Updated last year
kyamauchi1023 / PL-BERT-ja
View on GitHub
A repository of Japanese Phoneme-Level BERT
☆24Dec 16, 2023Updated 2 years ago
wilkinghoff / ssl4asd
View on GitHub
Code for the paper "Self-Supervised Learning for Anomalous Sound Detection"
☆43May 13, 2024Updated 2 years ago
merlresearch / reverberation-as-supervision
View on GitHub
Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation
☆15Aug 1, 2024Updated last year
korakoe / VALL-E-X
View on GitHub
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
☆16Apr 18, 2024Updated 2 years ago
CPJKU / dcase2024_task1_baseline
View on GitHub
☆10Jun 6, 2024Updated 2 years ago
Jinbo-Hu / SELD-Data-Generator
View on GitHub
Data generator for sound event localization and detection clips, including 4-ch microphone-array-format signals and first-order-ambisonic…
☆22Nov 13, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Audio-WestlakeU / UMA-ASR
View on GitHub
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
☆35Dec 17, 2024Updated last year
Audio-WestlakeU / audiossl
View on GitHub
A library built for easier audio self-supervised training, downstream tasks evaluation
☆140Sep 25, 2025Updated 9 months ago
sony / CLIPSep
View on GitHub
☆43Feb 21, 2023Updated 3 years ago
OptimusPrimus / tacos
View on GitHub
Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
☆16Oct 12, 2025Updated 9 months ago
Audio-WestlakeU / RVAE-EM
View on GitHub
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…
☆51Mar 6, 2025Updated last year
SUSTech-HPCLab / CS305-2025Spring-FinalProject
View on GitHub
☆11Jun 3, 2025Updated last year
iclr2024mcmi / ICLRMCMI
View on GitHub
Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information
☆12Sep 28, 2023Updated 2 years ago
Harper812 / FFDConv
View on GitHub
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
☆27May 13, 2026Updated 2 months ago
etzinis / optimal_condition_training
View on GitHub
Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…
☆14Feb 15, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
popcornell / FastMSS
View on GitHub
☆32May 18, 2026Updated 2 months ago
kohei0209 / self-remixing
View on GitHub
Official implementation of Self-Remixing
☆18Feb 3, 2024Updated 2 years ago
Audio-WestlakeU / VINP
View on GitHub
Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverb…
☆36Feb 23, 2026Updated 4 months ago
nobutaka-ito / pulse
View on GitHub
Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)
☆43Jul 24, 2023Updated 2 years ago
cpystan / PSM
View on GitHub
Exploring Unsupervised Cell Recognition with Prior Self-activation Maps (MICCAI 2023)
☆13Oct 27, 2023Updated 2 years ago
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
modern-hobbyist / aesir
View on GitHub
Collection of KiCad PCB designs for custom keyboards, including full-sized and split ergonomic layouts with STM32F072CBT6 microcontroller…
☆25Jan 11, 2026Updated 6 months ago