jim-schwoebel / audioset_modelsView external linksLinks
π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
β31Jun 17, 2024Updated last year
Alternatives and similar repositories for audioset_models
Users that are interested in audioset_models are comparing it to the libraries listed below
Sorting:
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β104Aug 1, 2023Updated 2 years ago
- Model drift detectionβ11Jul 22, 2023Updated 2 years ago
- Whisp - Environmental Sound Classifierβ13Aug 14, 2023Updated 2 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasetsβ14Nov 9, 2021Updated 4 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GANβ12Oct 22, 2020Updated 5 years ago
- This repository contains code that was used as an example of how to use Python to download part of the AudioSet dataset and use Tensorfloβ¦β13Aug 24, 2017Updated 8 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Jun 2, 2023Updated 2 years ago
- This is my Masters thesis project titled "Speaker Detection and Conversation Analysis on Mobile Devices".β15May 21, 2017Updated 8 years ago
- π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).β387Dec 8, 2022Updated 3 years ago
- βοΈβοΈ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).β90Jun 17, 2024Updated last year
- use baidu voice-api to add subtitle to a vedioβ15Mar 17, 2019Updated 6 years ago
- Learning embeddings for laughter categorizationβ34Nov 3, 2018Updated 7 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Kerasβ17Dec 18, 2024Updated last year
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β36Aug 8, 2023Updated 2 years ago
- EARS: Environmental Audio Recognition Systemβ121Apr 4, 2018Updated 7 years ago
- Download and create a tfreader for the audioset datasetβ16Apr 16, 2020Updated 5 years ago
- Building a Sound Classification iOS Application using AIβ19Aug 26, 2019Updated 6 years ago
- This repository consists of all the work done regarding Heart sound classification employing ANN, CNN and other methods, Android Applicatβ¦β18Jun 7, 2019Updated 6 years ago
- β14Sep 15, 2025Updated 5 months ago
- Small-footprint Keyword Spottingβ18Jul 28, 2019Updated 6 years ago
- Environmental sound classification using Deep Learning with extracted featuresβ168Jan 22, 2020Updated 6 years ago
- Fetch and use Google's AudioSet datasetβ126Apr 13, 2017Updated 8 years ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)β22Feb 24, 2022Updated 3 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)β45Jun 29, 2021Updated 4 years ago
- Surrey CVSSP DCASE 2018 Task 2 systemβ20Dec 26, 2022Updated 3 years ago
- Collaborative audio annotation toolβ18Sep 16, 2022Updated 3 years ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: spβ¦β131Jul 24, 2020Updated 5 years ago
- Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogramβ¦β25Jul 14, 2020Updated 5 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.β32Sep 13, 2023Updated 2 years ago
- An open-source CoreML model trained on the ESC10 datasetβ26Nov 6, 2020Updated 5 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16β22Jun 10, 2022Updated 3 years ago
- LogicCircuit is a program that helps build/simulate simple circuits using logic gates. It is meant to teach people the basics of how logiβ¦β10Jan 22, 2025Updated last year
- MATLAB Simulator for localizing a mobile wireless device using RSSI-Distance estimation.β27Feb 23, 2017Updated 8 years ago
- Extract frequency, power, width and dissonance of formants from wav filesβ28Jun 3, 2022Updated 3 years ago
- A TFLite-compatible fork of YAMNet from tensorflow/modelsβ31Jun 13, 2020Updated 5 years ago
- SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflowβ66Oct 1, 2020Updated 5 years ago
- The package 'data-driven density estimation x' (dddex) turns any standard point forecasting model into an estimator of the underlying conβ¦β10Dec 1, 2025Updated 2 months ago
- DCASE 2016 Baseline system, python implementationβ53Jul 20, 2017Updated 8 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should statβ¦β64Jan 8, 2021Updated 5 years ago