🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
☆47Feb 20, 2022Updated 4 years ago
Alternatives and similar repositories for sound_event_detection
Users that are interested in sound_event_detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Easy to use Audio Tagging in PyTorch☆23Aug 22, 2021Updated 4 years ago
- Python library for rapid prototyping of environmental sound analysis systems☆44May 20, 2022Updated 3 years ago
- Official Implementation of DMT: Dual Mean-Teacher in PyTorch.☆10Oct 27, 2023Updated 2 years ago
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.☆12Nov 12, 2022Updated 3 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Mar 30, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- ☆17Apr 16, 2026Updated 3 weeks ago
- [CVPR2025] Official code for Lost in Translation Found in Context☆24Jan 14, 2026Updated 3 months ago
- Polyphonic Sound Detection Score (PSDS)☆16Jan 20, 2020Updated 6 years ago
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆37Mar 10, 2026Updated last month
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Feb 8, 2023Updated 3 years ago
- ☆15Oct 15, 2020Updated 5 years ago
- A tool designed to extract numerical data from scanned historical weather documents.☆13Dec 1, 2024Updated last year
- Deep learning model for animal sound classification.☆35May 4, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Support library for the MaskRCNN masks extracted on EPIC-KITCHENS-100☆14Dec 1, 2020Updated 5 years ago
- Ono laboratory audio signal processing exercise for beginners.☆19May 10, 2023Updated 3 years ago
- CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)☆36May 20, 2025Updated 11 months ago
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆24Oct 31, 2025Updated 6 months ago
- Chinese edition. Based on KotlinBy`s one.☆22Jan 17, 2020Updated 6 years ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Mar 19, 2021Updated 5 years ago
- Initial repo for behavioral analyses☆10Aug 24, 2022Updated 3 years ago
- Source code complementing our paper for acoustic event classification using convolutional neural networks.☆70Jan 31, 2021Updated 5 years ago
- Speech2Action CVPR Poster Source Code☆20Apr 29, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pure Python MGRS coordinate converter.☆15Nov 23, 2025Updated 5 months ago
- Sound Event Detection (SED) paper collection☆17Jun 26, 2024Updated last year
- simple NMT With Attention For Arabic to English☆11Mar 5, 2022Updated 4 years ago
- [Not Official] Implementation of TC-Resnet, INTERSPEECH 2019☆22Jan 24, 2024Updated 2 years ago
- 2019☆11Aug 11, 2018Updated 7 years ago
- ☆10Mar 15, 2022Updated 4 years ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, IEEE ICASSP 2023☆34Dec 21, 2025Updated 4 months ago
- Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"☆12Nov 25, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆30Jun 17, 2024Updated last year
- Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch☆73Sep 27, 2021Updated 4 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago
- Source data, scripts and makefiles of the experiment for the Speex codec quality evaluation☆22Aug 29, 2011Updated 14 years ago
- Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch☆20Dec 16, 2021Updated 4 years ago
- ☆21Mar 6, 2025Updated last year