π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
β30Jun 17, 2024Updated 2 years ago
Alternatives and similar repositories for audioset_models
Users that are interested in audioset_models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β106Aug 1, 2023Updated 2 years ago
- This repository contains code that was used as an example of how to use Python to download part of the AudioSet dataset and use Tensorfloβ¦β13Aug 24, 2017Updated 8 years ago
- Download and create a tfreader for the audioset datasetβ17Apr 16, 2020Updated 6 years ago
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β36Aug 8, 2023Updated 2 years ago
- Whisp - Environmental Sound Classifierβ13Aug 14, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).β388Dec 8, 2022Updated 3 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Kerasβ17Dec 18, 2024Updated last year
- βοΈβοΈ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).β91Jun 17, 2024Updated 2 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GANβ12Oct 22, 2020Updated 5 years ago
- Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogramβ¦β25Jul 14, 2020Updated 5 years ago
- This is my Masters thesis project titled "Speaker Detection and Conversation Analysis on Mobile Devices".β15May 21, 2017Updated 9 years ago
- Fetch and use Google's AudioSet datasetβ127Apr 13, 2017Updated 9 years ago
- EARS: Environmental Audio Recognition Systemβ122Apr 4, 2018Updated 8 years ago
- A NoLimits Roller Coaster 1 and 2 Library written in C++β12Feb 16, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Jun 2, 2023Updated 3 years ago
- This code submission for the ICCV 17 Real Versus Fake Expressed Emotion Challenge provides source code to extract the features and classiβ¦β11Aug 28, 2017Updated 8 years ago
- Building a Sound Classification iOS Application using AIβ19Aug 26, 2019Updated 6 years ago
- Environmental sound classification using Deep Learning with extracted featuresβ168Jan 22, 2020Updated 6 years ago
- Android sound localization and classification app.β14Jul 4, 2025Updated 11 months ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spottingβ18Aug 26, 2025Updated 9 months ago
- β15Jan 22, 2025Updated last year
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)β23Feb 24, 2022Updated 4 years ago
- DCASE 2016 Baseline system, python implementationβ53Jul 20, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Sound event detection in real life audio with CNN submitted to DCASE16β22Jun 10, 2022Updated 4 years ago
- Implementation of semi-supervised learning: UDA, MixMatch, Mean-teacher, focusing on NLP, powered by Pytorchβ12Jan 6, 2021Updated 5 years ago
- γ Unity x Live2d x NaverClova x DL γ Personal Assistant bot that has an avatar from Live2d and connecting it with Unityβ10Dec 1, 2023Updated 2 years ago
- β49Aug 30, 2024Updated last year
- Learning embeddings for laughter categorizationβ34Nov 3, 2018Updated 7 years ago
- created based on universal_tool_template.py, together with module_photoshop.py, it allow you to run a cross-platform, cross-application Qβ¦β11Feb 8, 2017Updated 9 years ago
- β10Nov 29, 2019Updated 6 years ago
- β16Jan 25, 2023Updated 3 years ago
- Urban sounds classification with Covnolutional Neural Networksβ37Nov 15, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.β32Sep 13, 2023Updated 2 years ago
- Classifies percussion audio samples with a CNN-LSTM, written in python and pytorch. Also exports to Drumkv1 (lv2 plugin)β14Aug 20, 2020Updated 5 years ago
- Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".β16Dec 7, 2021Updated 4 years ago
- β15Oct 19, 2018Updated 7 years ago
- OoD Minimum Anomaly Score GAN - Code for the Paper 'OMASGAN: Out-of-Distribution Minimum Anomaly Score GAN for Sample Generation on the Bβ¦β16Jun 3, 2021Updated 5 years ago
- Small-footprint Keyword Spottingβ18Jul 28, 2019Updated 6 years ago
- CSCI572: Information Retrieval and Web Search Enginesβ10Jul 3, 2020Updated 5 years ago