multitel-ai / urban-sound-classification-and-comparisonView external linksLinks
Urban Sound Classification : striving towards a fair comparison
☆17Dec 11, 2020Updated 5 years ago
Alternatives and similar repositories for urban-sound-classification-and-comparison
Users that are interested in urban-sound-classification-and-comparison are comparing it to the libraries listed below
Sorting:
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Bag-of-Features Acoustic Event Detection☆14Oct 5, 2016Updated 9 years ago
- PyTorch implementation of Robust Subspace Recovery Layer for Unsupervised Anomaly Detection https://arxiv.org/abs/1904.00152☆14Apr 26, 2021Updated 4 years ago
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆101Apr 16, 2025Updated 10 months ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch☆73Sep 27, 2021Updated 4 years ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Mar 19, 2021Updated 4 years ago
- This repository contains example code to build models on TPUs☆30Feb 17, 2023Updated 3 years ago
- Audio classification via transfer learning☆35Oct 3, 2019Updated 6 years ago
- Visual Relationship Understanding☆10Oct 2, 2021Updated 4 years ago
- Automate issue discovery for your projects against Lightning nightly and releases.☆46May 6, 2025Updated 9 months ago
- A bookshelf generator Kit Extension created using OpenUSD☆11Jul 25, 2024Updated last year
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 4 years ago
- Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"☆11Sep 20, 2021Updated 4 years ago
- Deep learning and standard machine learning methods are developed and compared in classfying audio samples from microphones deployed abo…☆11Jan 17, 2020Updated 6 years ago
- Code for "Proposition-Level Clustering for Multi-Document Summarization" paper☆10Apr 5, 2024Updated last year
- Deep Audio Segmenter, unsupervised☆10Jan 29, 2022Updated 4 years ago
- ML Project control panel☆10Sep 30, 2022Updated 3 years ago
- Optimizable stack of images at different resolutions, a useful representation of images for deep learning tasks. Docs: https://johnowhita…☆11Sep 8, 2022Updated 3 years ago
- `junior must know his place` team solution☆10Aug 15, 2023Updated 2 years ago
- A fine multimodality fusion network :)☆11Aug 9, 2021Updated 4 years ago
- Tools to build knowledge graphs from multi-modal extractions☆12Apr 2, 2020Updated 5 years ago
- Caracal for python☆12Jun 3, 2022Updated 3 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 3 months ago
- ☆10Nov 15, 2021Updated 4 years ago
- Reference implementation and test synthetic data for Sorted Center Time echo density measure for acoustic impulse responses☆15Mar 18, 2020Updated 5 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- https://first-starfyre-app.netlify.app/☆10Jun 27, 2023Updated 2 years ago
- ☆11Mar 25, 2024Updated last year
- A Playground for Variational Autoencoders☆12Feb 11, 2018Updated 8 years ago
- ☆10Nov 10, 2021Updated 4 years ago
- Evaluation metrics and submission file creation scripts the Action Recognition challenge☆14Feb 9, 2026Updated last week
- https://arxiv.org/pdf/1806.03589v2.pdf☆11Mar 24, 2021Updated 4 years ago
- A Tensorflow implementation of Speech Emotion Recognition using Audio signals and Text Data☆12May 16, 2022Updated 3 years ago
- Annotation of in source LC/MS data☆12Oct 19, 2024Updated last year
- Siamese network for unsupervised speech representation learning☆11Oct 12, 2018Updated 7 years ago
- This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perc…☆15Nov 3, 2017Updated 8 years ago
- Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"☆11Oct 25, 2021Updated 4 years ago
- ☆11May 18, 2022Updated 3 years ago