unixpickle / audiosetView external linksLinks
Fetch and use Google's AudioSet dataset
☆126Apr 13, 2017Updated 8 years ago
Alternatives and similar repositories for audioset
Users that are interested in audioset are comparing it to the libraries listed below
Sorting:
- This repository contains code that was used as an example of how to use Python to download part of the AudioSet dataset and use Tensorflo…☆13Aug 24, 2017Updated 8 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆104Aug 1, 2023Updated 2 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Aug 20, 2024Updated last year
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆697May 21, 2018Updated 7 years ago
- ☆231Feb 9, 2020Updated 6 years ago
- ☆17Feb 14, 2020Updated 6 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆31Jun 17, 2024Updated last year
- Download and create a tfreader for the audioset dataset☆16Apr 16, 2020Updated 5 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Jul 25, 2024Updated last year
- An implementation of the Prism layer (https://arxiv.org/abs/2011.04823)☆12Nov 13, 2020Updated 5 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆656Apr 5, 2022Updated 3 years ago
- Toolkit for downloading and processing Google's AudioSet dataset.☆175Aug 22, 2025Updated 5 months ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆56Apr 25, 2023Updated 2 years ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆169May 14, 2022Updated 3 years ago
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- Paderbox: A collection of utilities for audio / speech processing☆43Jul 21, 2025Updated 6 months ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆57Oct 8, 2025Updated 4 months ago
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆13Sep 18, 2020Updated 5 years ago
- Tutorial covering Open Source tools for Source Separation.☆15Nov 12, 2021Updated 4 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 5 months ago
- Filter Banks, Fast Python Implementation☆42Jul 9, 2022Updated 3 years ago
- Frechet Audio Distance evaluation in PyTorch☆36Jun 9, 2023Updated 2 years ago
- VGGSound: A Large-scale Audio-Visual Dataset☆350Sep 13, 2021Updated 4 years ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆37Feb 24, 2025Updated 11 months ago
- Big Impulse Response Dataset☆156Oct 19, 2022Updated 3 years ago
- Phoneme recognition usign MFCC feature extraction and DTW analysis☆17Jul 13, 2019Updated 6 years ago
- Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"☆16Jul 8, 2020Updated 5 years ago
- Tutorials and examples for Google AudioSet☆17Sep 19, 2017Updated 8 years ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆120Aug 8, 2025Updated 6 months ago
- DCASE 2017 Baseline system☆82Jun 26, 2020Updated 5 years ago
- ☆46Dec 17, 2018Updated 7 years ago
- A TFLite-compatible fork of YAMNet from tensorflow/models☆31Jun 13, 2020Updated 5 years ago
- A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…☆218Jul 6, 2023Updated 2 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,132Nov 24, 2025Updated 2 months ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43May 24, 2022Updated 3 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago