π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
β30Jun 17, 2024Updated last year
Alternatives and similar repositories for audioset_models
Users that are interested in audioset_models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β106Aug 1, 2023Updated 2 years ago
- This repository contains code that was used as an example of how to use Python to download part of the AudioSet dataset and use Tensorfloβ¦β13Aug 24, 2017Updated 8 years ago
- Download and create a tfreader for the audioset datasetβ17Apr 16, 2020Updated 6 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasetsβ16Nov 9, 2021Updated 4 years ago
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β36Aug 8, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Whisp - Environmental Sound Classifierβ13Aug 14, 2023Updated 2 years ago
- π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).β388Dec 8, 2022Updated 3 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Kerasβ17Dec 18, 2024Updated last year
- βοΈβοΈ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).β91Jun 17, 2024Updated last year
- unsupervised ASR (mainly phone classifier) using EODM and GANβ12Oct 22, 2020Updated 5 years ago
- Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogramβ¦β25Jul 14, 2020Updated 5 years ago
- This is my Masters thesis project titled "Speaker Detection and Conversation Analysis on Mobile Devices".β15May 21, 2017Updated 9 years ago
- Fetch and use Google's AudioSet datasetβ127Apr 13, 2017Updated 9 years ago
- EARS: Environmental Audio Recognition Systemβ121Apr 4, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A NoLimits Roller Coaster 1 and 2 Library written in C++β12Feb 16, 2023Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Jun 2, 2023Updated 2 years ago
- Building a Sound Classification iOS Application using AIβ19Aug 26, 2019Updated 6 years ago
- use baidu voice-api to add subtitle to a vedioβ15Mar 17, 2019Updated 7 years ago
- Environmental sound classification using Deep Learning with extracted featuresβ168Jan 22, 2020Updated 6 years ago
- Android sound localization and classification app.β14Jul 4, 2025Updated 10 months ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spottingβ17Aug 26, 2025Updated 9 months ago
- Model drift detectionβ11Jul 22, 2023Updated 2 years ago
- β15Jan 22, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)β23Feb 24, 2022Updated 4 years ago
- DCASE 2016 Baseline system, python implementationβ53Jul 20, 2017Updated 8 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16β22Jun 10, 2022Updated 3 years ago
- Implementation of semi-supervised learning: UDA, MixMatch, Mean-teacher, focusing on NLP, powered by Pytorchβ12Jan 6, 2021Updated 5 years ago
- γ Unity x Live2d x NaverClova x DL γ Personal Assistant bot that has an avatar from Live2d and connecting it with Unityβ10Dec 1, 2023Updated 2 years ago
- β49Aug 30, 2024Updated last year
- The code for the paper, 'Meta-Curvature, Eunbyung Park and Junier Oliver, NeurIPS 2019'β11Jan 20, 2020Updated 6 years ago
- Learning embeddings for laughter categorizationβ34Nov 3, 2018Updated 7 years ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: spβ¦β129Jul 24, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repository consists of all the work done regarding Heart sound classification employing ANN, CNN and other methods, Android Applicatβ¦β18Jun 7, 2019Updated 6 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.β32Sep 13, 2023Updated 2 years ago
- Classifies percussion audio samples with a CNN-LSTM, written in python and pytorch. Also exports to Drumkv1 (lv2 plugin)β14Aug 20, 2020Updated 5 years ago
- C++ Program to detect Microphone Wind Noise in audio Filesβ51Oct 22, 2015Updated 10 years ago
- OoD Minimum Anomaly Score GAN - Code for the Paper 'OMASGAN: Out-of-Distribution Minimum Anomaly Score GAN for Sample Generation on the Bβ¦β15Jun 3, 2021Updated 4 years ago
- Small-footprint Keyword Spottingβ18Jul 28, 2019Updated 6 years ago
- implementing beamforming algorithm in C++β11Jan 9, 2020Updated 6 years ago