jim-schwoebel / audioset_modelsLinks
π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
β30Updated last year
Alternatives and similar repositories for audioset_models
Users that are interested in audioset_models are comparing it to the libraries listed below
Sorting:
- Download and create a tfreader for the audioset datasetβ16Updated 5 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, aβ¦β41Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presentβ¦β25Updated 2 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".β13Updated 2 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)β29Updated 6 years ago
- Constrained Permutation Invariant Training, Speech Separationβ47Updated 4 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Lβ¦β55Updated last year
- β16Updated 4 years ago
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β103Updated last year
- Baseline systems for the FSD50K datasetβ69Updated 3 years ago
- β46Updated 9 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detectionβ28Updated 2 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of aβ¦β95Updated 5 years ago
- COLA contrastive pre-training method implemented in PyTorchβ43Updated 4 years ago
- Benchmark for sound event localization task of DCASE 2019 challengeβ77Updated 4 years ago
- β18Updated 4 years ago
- β13Updated last year
- Paderbox: A collection of utilities for audio / speech processingβ38Updated last month
- Audio data augmentation examplesβ34Updated 7 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.β32Updated last year
- Filtering and Noise Adding Toolβ29Updated 3 years ago
- β24Updated 6 years ago
- Various algorithms for voice activity detectionβ22Updated 8 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.β42Updated 2 years ago
- Deep Speech Distances PyTorchβ29Updated 3 years ago
- β16Updated 6 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.β25Updated 6 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"β33Updated 3 years ago
- Streaming Audiotransformers for online Audio taggingβ45Updated last year
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarityβ12Updated 5 years ago