π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
β31Jun 17, 2024Updated last year
Alternatives and similar repositories for audioset_models
Users that are interested in audioset_models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β105Aug 1, 2023Updated 2 years ago
- This repository contains code that was used as an example of how to use Python to download part of the AudioSet dataset and use Tensorfloβ¦β13Aug 24, 2017Updated 8 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasetsβ15Nov 9, 2021Updated 4 years ago
- Download and create a tfreader for the audioset datasetβ16Apr 16, 2020Updated 5 years ago
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β36Aug 8, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Whisp - Environmental Sound Classifierβ13Aug 14, 2023Updated 2 years ago
- π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).β388Dec 8, 2022Updated 3 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Kerasβ17Dec 18, 2024Updated last year
- βοΈβοΈ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).β90Jun 17, 2024Updated last year
- unsupervised ASR (mainly phone classifier) using EODM and GANβ12Oct 22, 2020Updated 5 years ago
- Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogramβ¦β25Jul 14, 2020Updated 5 years ago
- This is my Masters thesis project titled "Speaker Detection and Conversation Analysis on Mobile Devices".β15May 21, 2017Updated 8 years ago
- Fetch and use Google's AudioSet datasetβ127Apr 13, 2017Updated 8 years ago
- EARS: Environmental Audio Recognition Systemβ121Apr 4, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A NoLimits Roller Coaster 1 and 2 Library written in C++β11Feb 16, 2023Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Jun 2, 2023Updated 2 years ago
- Building a Sound Classification iOS Application using AIβ19Aug 26, 2019Updated 6 years ago
- use baidu voice-api to add subtitle to a vedioβ15Mar 17, 2019Updated 7 years ago
- Environmental sound classification using Deep Learning with extracted featuresβ168Jan 22, 2020Updated 6 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spottingβ17Aug 26, 2025Updated 7 months ago
- Model drift detectionβ11Jul 22, 2023Updated 2 years ago
- Android sound localization and classification app.β14Jul 4, 2025Updated 8 months ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)β23Feb 24, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- DCASE 2016 Baseline system, python implementationβ53Jul 20, 2017Updated 8 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16β22Jun 10, 2022Updated 3 years ago
- Implementation of semi-supervised learning: UDA, MixMatch, Mean-teacher, focusing on NLP, powered by Pytorchβ12Jan 6, 2021Updated 5 years ago
- γ Unity x Live2d x NaverClova x DL γ Personal Assistant bot that has an avatar from Live2d and connecting it with Unityβ10Dec 1, 2023Updated 2 years ago
- Learning embeddings for laughter categorizationβ34Nov 3, 2018Updated 7 years ago
- created based on universal_tool_template.py, together with module_photoshop.py, it allow you to run a cross-platform, cross-application Qβ¦β11Feb 8, 2017Updated 9 years ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: spβ¦β131Jul 24, 2020Updated 5 years ago
- Urban sounds classification with Covnolutional Neural Networksβ37Nov 15, 2019Updated 6 years ago
- This repository consists of all the work done regarding Heart sound classification employing ANN, CNN and other methods, Android Applicatβ¦β18Jun 7, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.β32Sep 13, 2023Updated 2 years ago
- Classifies percussion audio samples with a CNN-LSTM, written in python and pytorch. Also exports to Drumkv1 (lv2 plugin)β14Aug 20, 2020Updated 5 years ago
- Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".β16Dec 7, 2021Updated 4 years ago
- OoD Minimum Anomaly Score GAN - Code for the Paper 'OMASGAN: Out-of-Distribution Minimum Anomaly Score GAN for Sample Generation on the Bβ¦β15Jun 3, 2021Updated 4 years ago
- Small-footprint Keyword Spottingβ18Jul 28, 2019Updated 6 years ago
- CSCI572: Information Retrieval and Web Search Enginesβ10Jul 3, 2020Updated 5 years ago
- implementing beamforming algorithm in C++β11Jan 9, 2020Updated 6 years ago