RicherMans/Dcase2018_pooling

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RicherMans/Dcase2018_pooling)

RicherMans / Dcase2018_pooling

Repo for our pooling approach on the DCASE2018 task4

☆16

Alternatives and similar repositories for Dcase2018_pooling

Users that are interested in Dcase2018_pooling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qiuqiangkong / dcase2019_task4
View on GitHub
☆21Apr 11, 2019Updated 7 years ago
RicherMans / CDur
View on GitHub
Repository for the paper "Towards duration robust weakly supervised sound event detection"
☆23Aug 3, 2023Updated 2 years ago
LCAV / localization-icassp2018
View on GitHub
Code of paper "Combining range and direction for improved localization" presented at ICASSP2018
☆10Apr 20, 2018Updated 8 years ago
RicherMans / GPV
View on GitHub
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
☆141Aug 3, 2023Updated 2 years ago
echocatzh / Demo-of-DeepComplexAEC
View on GitHub
☆11Jun 15, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
qiuqiangkong / gan_separation_deconvolution
View on GitHub
☆11Jun 2, 2019Updated 7 years ago
midas-research / speechmix
View on GitHub
☆12Oct 2, 2020Updated 5 years ago
Yifei-ZHAO96 / STAM-pytorch
View on GitHub
Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated 2 years ago
Le-Xiaohuai-speech / GMM_VAD
View on GitHub
☆17Apr 3, 2022Updated 4 years ago
RicherMans / Datadriven-GPVAD
View on GitHub
The codebase for Data-driven general-purpose voice activity detection.
☆93Aug 3, 2023Updated 2 years ago
qiuqiangkong / audioset_source_separation
View on GitHub
☆17Feb 14, 2020Updated 6 years ago
pahud / amazon-eks-gpu-scale
View on GitHub
NVIDIA GPU autoscaling on Amazon EKS
☆12Jul 7, 2019Updated 7 years ago
RicherMans / SpokenLanguageClassifiers
View on GitHub
Pretrained spoken language classifiers from audio.
☆10Jan 21, 2021Updated 5 years ago
swagshaw / WildDESED
View on GitHub
WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection
☆18Nov 19, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
tqbl / gccaps
View on GitHub
An implementation of capsule routing for sound event detection
☆15Jan 29, 2019Updated 7 years ago
soobinseo / wavenet
View on GitHub
Audio source separation (mixture to vocal) using the Wavenet
☆21Sep 6, 2017Updated 8 years ago
dr-costas / undaw
View on GitHub
Unsupervised Domain Adaptation for Acoustic Scene Classification with Wasserstein Distance
☆14Sep 16, 2020Updated 5 years ago
jeongHwarr / sednn_modify
View on GitHub
Python 3.5 and Windows version of Speech Enhancement using DNN by Yong Xu and Qiuqiang Kong
☆15Mar 13, 2019Updated 7 years ago
rhasspy / ipa2kaldi
View on GitHub
Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)
☆10Jun 2, 2021Updated 5 years ago
FlorianKrey / DNC
View on GitHub
Discriminative Neural Clustering for Speaker Diarisation
☆79Apr 8, 2022Updated 4 years ago
robd003 / sph2pipe
View on GitHub
provide SPHERE-formatted output as well as RIFF, AU, AIFF and raw
☆14Dec 18, 2021Updated 4 years ago
kevinco27 / attentional-similarity
View on GitHub
Pytorch implementation of [Learning to match transient sound events using attentional similarity for few-shot sound recognition]
☆33Feb 27, 2019Updated 7 years ago
tli725 / JL-Corpus
View on GitHub
For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…
☆11Oct 29, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LCAV / TimeDomainAcousticRakeReceiver
View on GitHub
Software design and analysis tools for the acoustic rake receiver, a microphone beamformer that uses echoes to improve the noise and inte…
☆14May 5, 2015Updated 11 years ago
turpaultn / DCASE2019_task4
View on GitHub
Baseline of dcase 2019 task 4
☆61Sep 2, 2022Updated 3 years ago
dr-costas / SEDLM
View on GitHub
Language modelling for sound event detection
☆20Jan 2, 2020Updated 6 years ago
LCAV / pylocus
View on GitHub
Localization package using distance and/or angle measurements
☆16Mar 11, 2022Updated 4 years ago
qiuqiangkong / dcase2019_task3
View on GitHub
☆16Apr 11, 2019Updated 7 years ago
MaigoAkisame / cmu-thesis
View on GitHub
Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling
☆169May 14, 2022Updated 4 years ago
vlsi-nanocomputing / dynamic-sound
View on GitHub
DynamicSound Simulator is a modular Python library for generating virtual acoustic scenes with configurable microphones, sound sources, a…
☆18Jul 15, 2026Updated last week
karolpiczak / echonet
View on GitHub
Convolutional neural networks for sound classification
☆20Dec 30, 2017Updated 8 years ago
ZiangLong / LPCNet_pytorch
View on GitHub
A Pytorch version of LPCNet, including dump weight
☆36May 5, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
faroit / commonfate
View on GitHub
☆17Feb 27, 2020Updated 6 years ago
amusi / awesome-semantic-segmentation
View on GitHub
awesome-semantic-segmentation
☆11Jun 6, 2018Updated 8 years ago
toni-heittola / dcase2019_task1_baseline
View on GitHub
DCASE2019 Challenge Task 1 baseline system
☆20Oct 11, 2019Updated 6 years ago
qiuqiangkong / dcase2019_task1
View on GitHub
☆20May 13, 2019Updated 7 years ago
CaA23187 / VAD-based-on-LSTM
View on GitHub
A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.
☆13Dec 3, 2020Updated 5 years ago
hongfeixue / StutteringSpeechChallenge
View on GitHub
SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
☆12Jun 11, 2024Updated 2 years ago
qiuqiangkong / dcase2018_task4
View on GitHub
☆13Aug 26, 2018Updated 7 years ago