giusenso / seld-tcnView external linksLinks
SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflow
☆66Oct 1, 2020Updated 5 years ago
Alternatives and similar repositories for seld-tcn
Users that are interested in seld-tcn are comparing it to the libraries listed below
Sorting:
- Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional re…☆379Nov 21, 2022Updated 3 years ago
- This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.☆104May 31, 2022Updated 3 years ago
- Baseline method for sound event localization task of DCASE 2021 challenge☆42Jun 15, 2021Updated 4 years ago
- ☆19Jun 10, 2021Updated 4 years ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆17Nov 9, 2022Updated 3 years ago
- A machine learning algorithm that estimates the directions of arrival and relative levels of an arbitrary number of sound sources using r…☆12Dec 10, 2022Updated 3 years ago
- A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…☆10Dec 16, 2017Updated 8 years ago
- Acoustic event detection using recurrent neural networks.☆11Sep 4, 2018Updated 7 years ago
- A Direction-of-Arrival estimation code repo accompanying our research paper.☆83Feb 9, 2020Updated 6 years ago
- Benchmarking deep learning models for real-time object detection on various platforms☆13Jan 26, 2018Updated 8 years ago
- Robot (or Device) Localization Using Particle Filter over DOA of Wireless Signals☆15Jul 21, 2024Updated last year
- ☆15Apr 9, 2022Updated 3 years ago
- ☆12Jun 8, 2017Updated 8 years ago
- Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"☆16Jul 8, 2020Updated 5 years ago
- Graph Neural Networks for Sound Source Localization☆26Oct 31, 2023Updated 2 years ago
- Pytorch implementation of the icosahedral CNNs☆20Apr 24, 2023Updated 2 years ago
- ☆15May 18, 2024Updated last year
- Official source code of Arrhythmia Detection☆16Jun 25, 2019Updated 6 years ago
- ☆47Nov 12, 2021Updated 4 years ago
- Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks☆87Mar 24, 2023Updated 2 years ago
- CNN based single speaker localization☆50Aug 28, 2020Updated 5 years ago
- ☆20Jun 21, 2022Updated 3 years ago
- ☆23Jan 6, 2023Updated 3 years ago
- Training code of Cornell Birdcall Identification Challenge 6th place solution☆50Oct 12, 2020Updated 5 years ago
- A large-scale evaluation benchmark called DeepFaceGen, aimed at quantitatively assessing the effectiveness of face forgery detection and …☆25May 31, 2025Updated 8 months ago
- ☆95Jun 22, 2023Updated 2 years ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- LogicCircuit is a program that helps build/simulate simple circuits using logic gates. It is meant to teach people the basics of how logi…☆10Jan 22, 2025Updated last year
- Extract frequency, power, width and dissonance of formants from wav files☆28Jun 3, 2022Updated 3 years ago
- ☆67Sep 13, 2024Updated last year
- PyTorch Code for Feature Boosting, Suppression, and Diversification for Fine-Grained Visual Classification☆23Apr 16, 2021Updated 4 years ago
- Spectrogram is selected as preprocessing feature of audio clips and a feature representation method based on deep residual network (Spec-…☆26Sep 13, 2020Updated 5 years ago
- 🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.☆46Feb 20, 2022Updated 3 years ago
- Parameter Estimation in Multi-standard Wideband Receivers via Deep Learning. (DOA - Direction of Arrival)☆27Apr 22, 2022Updated 3 years ago
- Adapting a ConvNeXt model to audio classification on AudioSet☆27Feb 19, 2025Updated 11 months ago
- A DNN based Normalized Time-frequency Weighted Criterion for Robust Wideband DoA Estimation☆34Feb 21, 2023Updated 2 years ago
- EARS: Environmental Audio Recognition System☆121Apr 4, 2018Updated 7 years ago
- A TFLite-compatible fork of YAMNet from tensorflow/models☆31Jun 13, 2020Updated 5 years ago
- A repository to collect coding material for the Polimi's course "Creative Programming and Computing" held by prof. Massimiliano Zanoni in…☆11Dec 11, 2019Updated 6 years ago