jim-schwoebel/sound_event_detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jim-schwoebel/sound_event_detection)

jim-schwoebel / sound_event_detection

🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.

☆47

Alternatives and similar repositories for sound_event_detection

Users that are interested in sound_event_detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dr-costas / SEDLM
View on GitHub
Language modelling for sound event detection
☆20Jan 2, 2020Updated 6 years ago
MTG / DCASE-models
View on GitHub
Python library for rapid prototyping of environmental sound analysis systems
☆44May 20, 2022Updated 4 years ago
sithu31296 / audio-tagging
View on GitHub
Easy to use Audio Tagging in PyTorch
☆23Aug 22, 2021Updated 4 years ago
soham97 / awesome-sound_event_detection
View on GitHub
Reading list for research topics in Sound AI
☆201Aug 8, 2024Updated last year
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
BUTSpeechFIT / mt-asr-data-prep
View on GitHub
☆25Feb 26, 2026Updated 5 months ago
qiuqiangkong / dcase2019_task3
View on GitHub
☆16Apr 11, 2019Updated 7 years ago
TWOEARS / documentation
View on GitHub
Documentation of the Two!Ears Auditory Model
☆13Feb 14, 2019Updated 7 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
gyx-gloria / DMT
View on GitHub
Official Implementation of DMT: Dual Mean-Teacher in PyTorch.
☆10Oct 27, 2023Updated 2 years ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
vadimkantorov / readaudio
View on GitHub
Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)
☆11Aug 12, 2020Updated 5 years ago
cnlinxi / speech_emotion
View on GitHub
Detect emotion from audio
☆14Nov 20, 2018Updated 7 years ago
JRMeyer / speakerID-challenge
View on GitHub
A recipe for creating a Speaker Identification system built on Kaldi.
☆15Jan 2, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TUIlmenauAMS / FilterBanks_PythonKerasNeuralNetworkImplemention
View on GitHub
Filter Bank Implementaion as Convolutional Neural Network using Python Keras
☆17Dec 18, 2024Updated last year
gooofy / kaldi-adapt-lm
View on GitHub
Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model
☆33Jan 26, 2020Updated 6 years ago
alumae / torch-xvectors-wav
View on GitHub
☆22Jun 30, 2021Updated 5 years ago
JaesungHuh / VoxMovies
View on GitHub
Evaluation script for VoxMovies dataset in PyTorch
☆23Jan 12, 2024Updated 2 years ago
rfalcon100 / seld_dcase2022_ric
View on GitHub
My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.
☆12Nov 12, 2022Updated 3 years ago
a-nagrani / VoxSRC2020
View on GitHub
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020
☆43Jul 17, 2020Updated 6 years ago
i3thuan5 / FaNT
View on GitHub
Filtering and Noise Adding Tool
☆29May 27, 2022Updated 4 years ago
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
kahst / AcousticEventDetection
View on GitHub
Source code complementing our paper for acoustic event classification using convolutional neural networks.
☆70Jan 31, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zhaoyi2 / audio_augment
View on GitHub
A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN
☆25Jun 28, 2020Updated 6 years ago
dr-costas / dnd-sed
View on GitHub
Sound event detection with depthwise separable and dilated convolutions.
☆53Mar 30, 2020Updated 6 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
zhao-shuyang / active_learning
View on GitHub
The active learning algorithm, mismatch-first farthest-traversal. Implementation and visualization.
☆12Dec 25, 2021Updated 4 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
jim-schwoebel / audioset_models
View on GitHub
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
☆31Jun 17, 2024Updated 2 years ago
Akshat4112 / voicenet
View on GitHub
Comprehensive Python library for speech and voice.
☆32Dec 8, 2022Updated 3 years ago
BUTSpeechFIT / AMI-diarization-setup
View on GitHub
☆54Oct 17, 2023Updated 2 years ago
fakufaku / separake
View on GitHub
Echo aware source separation
☆13May 29, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dense-analysis / vim-speech
View on GitHub
Vim Speech Recognition Experiments
☆20May 30, 2025Updated last year
asteroid-team / Libri_VAD
View on GitHub
Script to generate VAD dataset used in Asteroid recipe
☆21Sep 30, 2021Updated 4 years ago
claritychallenge / clarity_CC
View on GitHub
Support for Clarity Enhancement and Prediction Challenges (obsolete - see README)
☆48Apr 14, 2022Updated 4 years ago
MiukkaZh / MGT
View on GitHub
Learning Domain-Invariant Transformation for Speaker Verification.
☆11Jun 13, 2023Updated 3 years ago
desh2608 / pytorch-tdnn
View on GitHub
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Dec 18, 2020Updated 5 years ago
miras-tech / MirasVoice
View on GitHub
MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…
☆19Mar 15, 2020Updated 6 years ago
LeBenchmark / Interspeech2021
View on GitHub
This repository describes our reproducible framework for assessing self-supervised representation learning from speech
☆52Oct 8, 2021Updated 4 years ago