theMoro/EfficientSED

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/theMoro/EfficientSED)

theMoro / EfficientSED

☆22

Alternatives and similar repositories for EfficientSED

Users that are interested in EfficientSED are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fschmid56 / PretrainedSED
View on GitHub
☆144May 13, 2025Updated last year
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
OptimusPrimus / tacos
View on GitHub
Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
☆16Oct 12, 2025Updated 9 months ago
fschmid56 / cpjku_dcase23
View on GitHub
This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"
☆32Sep 18, 2023Updated 2 years ago
cai525 / Transformer4SED
View on GitHub
This repository aims to collect Transformer-based sound event detection (SED) algorithms.
☆104Feb 10, 2026Updated 5 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
theMoro / DIRAugmentation
View on GitHub
Improving Recording Device Generalization using Impulse Response Augmentation
☆21Apr 24, 2025Updated last year
RicherMans / CED
View on GitHub
Source code for Consistent ensemble distillation for audio tagging
☆75Mar 20, 2026Updated 4 months ago
CPJKU / cpjku_dcase24
View on GitHub
☆29Oct 17, 2024Updated last year
daihuangyu / speex_aec_kf
View on GitHub
speex aec kalman filter
☆15Mar 17, 2024Updated 2 years ago
fschmid56 / EfficientAT_HEAR
View on GitHub
Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.
☆34Jun 23, 2023Updated 3 years ago
merlresearch / sebbs
View on GitHub
Prediction of sound event bounding boxes (SEBBs)
☆35Aug 2, 2024Updated last year
fgnt / sed_scores_eval
View on GitHub
☆41Feb 18, 2026Updated 5 months ago
CPJKU / dcase2024_task1_baseline
View on GitHub
☆10Jun 6, 2024Updated 2 years ago
minthanthtoo / myanmar-collation-stats
View on GitHub
Myanmar lexicon analyzer - Sorting and Segmentation
☆10Aug 11, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
fosfrancesco / pkspell
View on GitHub
Predict the correct pitch spelling and key signatures given a sequence of midi notes by using a deep-learning approach.
☆18Jul 26, 2022Updated 3 years ago
tpt-adasp / salt
View on GitHub
SALT: STANDARDIZED AUDIO EVENT LABEL TAXONOMY
☆15Nov 28, 2024Updated last year
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
ye-kyaw-thu / Spectrograms-of-Myanmar-Speech
View on GitHub
Myanmar consonant and vowel audio files that I recorded at University of Computer Studies Banmaw
☆11Mar 2, 2019Updated 7 years ago
f0k / minimp3py
View on GitHub
Python bindings for minimp3
☆17Sep 11, 2023Updated 2 years ago
daniel03c1 / NAS_VAD
View on GitHub
☆26Oct 25, 2024Updated last year
nttcslab / dcase2025_task4_baseline
View on GitHub
☆18Apr 16, 2026Updated 3 months ago
HuwCheston / Jazz-Trio-Database
View on GitHub
The Jazz Trio Database is a dataset composed of about 45 hours of jazz performances annotated by an automated signal processing pipeline.
☆16Sep 27, 2025Updated 9 months ago
ShiningLab / POS-Tagger-for-Punctuation-Restoration
View on GitHub
This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…
☆11May 24, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
csukuangfj / icefall
View on GitHub
☆11Jul 16, 2026Updated last week
ondrejklejch / acoustic_punctuation
View on GitHub
NMT based punctuation prediction system using lexical and acoustic features .
☆14Mar 30, 2020Updated 6 years ago
mehedihasanbijoy / DPCSpell
View on GitHub
[Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages
☆14Aug 9, 2024Updated last year
catherine-qian / cocosda-SSL
View on GitHub
pytorch code for sound event localization and classification
☆13Aug 12, 2021Updated 4 years ago
zzhhfut / CCNet-AAAI2025
View on GitHub
This repository contains code for AAAI2025 paper "Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal …
☆24Aug 18, 2025Updated 11 months ago
yangjingyuan / ConstDecoder
View on GitHub
☆11Oct 24, 2022Updated 3 years ago
donghoney0416 / DeepASA
View on GitHub
Official page of "DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis"
☆26Apr 15, 2026Updated 3 months ago
CPJKU / cpjku_dcase22
View on GitHub
☆19Jul 15, 2022Updated 4 years ago
huispaty / batik_plays_mozart
View on GitHub
A note-aligned performance-to-score-to-annotations dataset of 12 complete Mozart piano sonatas for expressive performance analysis
☆23Feb 2, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ohbendy / Myanmar-font-resources
View on GitHub
Bits and bobs for making and checking Myanmar fonts
☆12Feb 2, 2026Updated 5 months ago
YongyuG / dnn_aec_data_process
View on GitHub
pre-process script for timit data for dnn-aec works
☆38Mar 3, 2022Updated 4 years ago
w-transposed-x / hifi-gan-denoising
View on GitHub
An unofficial PyTorch implementation of "HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversari…
☆24Feb 5, 2021Updated 5 years ago
worldveil / musical_mel_transform_torch
View on GitHub
Musical mel transform for semi/quarter-tone features, written in ONNX-compatible PyTorch for audio AI neural networks
☆20Feb 20, 2026Updated 5 months ago
fosfrancesco / musicparser
View on GitHub
Deep learning based dependency parsing for music sequences
☆26Jul 19, 2023Updated 3 years ago
blmoistawinde / fense
View on GitHub
Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval…
☆21Feb 1, 2023Updated 3 years ago
fschmid56 / EfficientAT
View on GitHub
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …
☆353Nov 20, 2024Updated last year