yqcai888/easy_dcase_task1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yqcai888/easy_dcase_task1)

yqcai888 / easy_dcase_task1

This repository provides an easy way to train your models on the datasets of DCASE task 1.

☆20

Alternatives and similar repositories for easy_dcase_task1

Users that are interested in easy_dcase_task1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CPJKU / dcase2024_task1_baseline
View on GitHub
☆10Jun 6, 2024Updated 2 years ago
yqcai888 / DCASE2023
View on GitHub
2022 DCASE Challenge
☆14Sep 30, 2024Updated last year
Dahan-Wang / Adaptive-Convolution-for-CNN-based-Speech-Enhancement-Models
View on GitHub
☆16Feb 22, 2025Updated last year
yhsong06 / LAU-Net
View on GitHub
☆16May 23, 2025Updated last year
fschmid56 / cpjku_dcase23
View on GitHub
This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"
☆32Sep 18, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OptimusPrimus / tacos
View on GitHub
Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
☆16Oct 12, 2025Updated 9 months ago
SRPOL-AUI / spectrum-correction
View on GitHub
Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"
☆13Feb 22, 2022Updated 4 years ago
rfalcon100 / seld_dcase2022_ric
View on GitHub
My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.
☆12Nov 12, 2022Updated 3 years ago
jsvir / vad
View on GitHub
[Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection
☆40Mar 24, 2025Updated last year
Qualcomm-AI-research / bcresnet
View on GitHub
☆100May 31, 2023Updated 3 years ago
DCASE2024-Task7-Sound-Scene-Synthesis / AudioLDM-training-finetuning
View on GitHub
AudioLDM training, finetuning, evaluation and inference.
☆14Mar 27, 2024Updated 2 years ago
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
RonFrancesca / dcase2020-fp
View on GitHub
☆10Jun 26, 2020Updated 6 years ago
nttcslab / dcase2025_task4_baseline
View on GitHub
☆18Apr 16, 2026Updated 3 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
denfed / wave-spec-fusion
View on GitHub
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…
☆16Aug 9, 2021Updated 4 years ago
jnwnlee / video-foley
View on GitHub
Official implementation of "Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound". IEEE TASLP 20…
☆19Feb 27, 2026Updated 5 months ago
SpoaLove / XJTLU_ICSY2
View on GitHub
Everything ICS Y2 HW, Notes, and other stuffs
☆12Dec 9, 2022Updated 3 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
CPJKU / cpjku_dcase22
View on GitHub
☆19Jul 15, 2022Updated 4 years ago
theMoro / DIRAugmentation
View on GitHub
Improving Recording Device Generalization using Impulse Response Augmentation
☆21Apr 24, 2025Updated last year
audiolabs / PESQ
View on GitHub
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band) - including P.862 Corrigendum 2 (03/…
☆23May 27, 2025Updated last year
v-iashin / Synchformer
View on GitHub
Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)
☆130Sep 15, 2025Updated 10 months ago
theMoro / EfficientSED
View on GitHub
☆22Jun 12, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
mad-lab-fau / tpcp
View on GitHub
Pipeline and Dataset helpers for complex algorithm evaluation.
☆20Updated this week
bootphon / sustained-phonation-features
View on GitHub
Python package for the extraction of speech features for sustained phonation
☆12Aug 10, 2020Updated 5 years ago
pyvista / pyvistaqt-exe
View on GitHub
Create a Windows installable exe from a PyVistaQt application
☆16Jul 13, 2026Updated 2 weeks ago
robflynnyh / long-context-asr
View on GitHub
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆11Jul 3, 2026Updated 3 weeks ago
elianap / divexplorer
View on GitHub
☆11May 5, 2022Updated 4 years ago
Engineev / solutions
View on GitHub
My personal solutions to some textbook problems
☆12Feb 12, 2020Updated 6 years ago
fschmid56 / EfficientAT_HEAR
View on GitHub
Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.
☆34Jun 23, 2023Updated 3 years ago
CPJKU / cpjku_dcase24
View on GitHub
☆29Oct 17, 2024Updated last year
zzpDapeng / Transformer-Transducer
View on GitHub
A streamable speech recognition model with transformer encoders and RNN-T loss
☆11Mar 1, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
qiuqiangkong / dcase2019_task3
View on GitHub
☆16Apr 11, 2019Updated 7 years ago
wilkinghoff / DCASE2023_task2
View on GitHub
Submission for task 2 "First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring" of the DCASE challenge 2023 (h…
☆18May 22, 2023Updated 3 years ago
bene-ges / nemo_compatible
View on GitHub
useful things that work with NVIDIA NeMo library
☆14Jan 20, 2024Updated 2 years ago
xavierfav / feature-comparison-clustering
View on GitHub
Comparing Audio Features for Unsupervised Sound Classification
☆10Jun 22, 2022Updated 4 years ago
lugan113 / SynTTS-Commands-Official
View on GitHub
SynTTS-Commands is a large-scale, multilingual (English & Chinese) synthetic speech command dataset designed for low-power Keyword Spotti…
☆17Feb 5, 2026Updated 5 months ago
KathyReid / cvaccents
View on GitHub
A set of tools for working with accent data in Mozilla's Common Voice dataset
☆14Nov 3, 2023Updated 2 years ago
snap-research / GenAU
View on GitHub
☆53Mar 24, 2026Updated 4 months ago