merlresearch/sebbs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/merlresearch/sebbs)

merlresearch / sebbs

Prediction of sound event bounding boxes (SEBBs)

☆35

Alternatives and similar repositories for sebbs

Users that are interested in sebbs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CPJKU / cpjku_dcase24
View on GitHub
☆29Oct 17, 2024Updated last year
Audio-WestlakeU / ATST-SED
View on GitHub
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
☆174Jun 8, 2026Updated last month
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
BUTSpeechFIT / SOT-DiCoW
View on GitHub
Multi-talker ASR based on DiCoW with Serialized Output Training
☆20Sep 18, 2025Updated 10 months ago
fgnt / sed_scores_eval
View on GitHub
☆41Feb 18, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
fschmid56 / PretrainedSED
View on GitHub
☆145May 13, 2025Updated last year
Sreyan88 / ReCLAP
View on GitHub
☆33Dec 23, 2025Updated 7 months ago
Mddct / usm-tokenizer
View on GitHub
semantic tokenizer for speech and music
☆20Jul 6, 2025Updated last year
eloimoliner / unconditional-diff-STFT
View on GitHub
Unconditional music synthesis using a diffusion model in the STFT domain
☆12May 31, 2022Updated 4 years ago
Mddct / transformer-vocos
View on GitHub
☆35Sep 6, 2025Updated 10 months ago
asigalov61 / Euterpe-X
View on GitHub
[DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…
☆33Nov 23, 2023Updated 2 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
jiwonix / Sound-Event-Detection-papers
View on GitHub
Sound Event Detection (SED) paper collection
☆15Jun 26, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Audio-WestlakeU / audiossl
View on GitHub
A library built for easier audio self-supervised training, downstream tasks evaluation
☆140Sep 25, 2025Updated 10 months ago
fschmid56 / cpjku_dcase23
View on GitHub
This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"
☆32Sep 18, 2023Updated 2 years ago
fgnt / pb_sed
View on GitHub
Paderborn Sound Event Detection
☆80Jul 18, 2023Updated 3 years ago
Audio-AGI / dcase2024_task9_baseline
View on GitHub
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
snap-research / GenAU
View on GitHub
☆53Mar 24, 2026Updated 4 months ago
yuhanghe01 / RiTTA
View on GitHub
Event Relation in Text-to-Audio (TTA) Generation
☆21Feb 26, 2025Updated last year
qiuqiangkong / audio_understanding
View on GitHub
☆131Feb 6, 2025Updated last year
frednam93 / MDFD-SED
View on GitHub
☆21Mar 6, 2025Updated last year
Ming-er / LGC-SED
View on GitHub
☆13Jan 3, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wx9Songs / MOSS-Music-Data-Pipeline
View on GitHub
☆44Apr 26, 2026Updated 2 months ago
Mddct / cosyvoice2-flow-optimized
View on GitHub
faster inference
☆27Jan 20, 2025Updated last year
merlresearch / tssep
View on GitHub
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
☆43Oct 27, 2025Updated 8 months ago
Ming-er / Audio-Free-P-Tuning
View on GitHub
☆11Dec 28, 2023Updated 2 years ago
BUTSpeechFIT / DeCRED
View on GitHub
☆18Aug 13, 2025Updated 11 months ago
pengzhendong / ngram-punctuator
View on GitHub
An N-gram punctuator for Chinese and English.
☆18Oct 14, 2025Updated 9 months ago
fschmid56 / EfficientAT_HEAR
View on GitHub
Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.
☆34Jun 23, 2023Updated 3 years ago
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 10 months ago
OptimusPrimus / tacos
View on GitHub
Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
☆16Oct 12, 2025Updated 9 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
lavendery / AudioComposer
View on GitHub
☆27Sep 10, 2025Updated 10 months ago
qiuqiangkong / audioflow
View on GitHub
☆130Updated this week
Audio-WestlakeU / RVAE-EM
View on GitHub
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…
☆51Mar 6, 2025Updated last year
mubingshen / MLC-SLM-Baseline
View on GitHub
The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…
☆51May 14, 2025Updated last year
BUTSpeechFIT / mt-asr-data-prep
View on GitHub
☆25Feb 26, 2026Updated 4 months ago
frednam93 / FDY-SED
View on GitHub
☆96Jun 22, 2023Updated 3 years ago
ilpoviertola / V-AURA
View on GitHub
The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)
☆35Feb 11, 2026Updated 5 months ago