Download and create a tfreader for the audioset dataset
☆16Apr 16, 2020Updated 6 years ago
Alternatives and similar repositories for audioset_raw
Users that are interested in audioset_raw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020☆16Oct 20, 2020Updated 5 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆30Jun 17, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Conferencing Speech Challenge☆95Apr 6, 2021Updated 5 years ago
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆14Sep 18, 2020Updated 5 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.☆35Jul 8, 2024Updated last year
- ☆17Feb 14, 2020Updated 6 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Rainbow Keywords - Official PyTorch Implementation☆14Jun 27, 2024Updated last year
- ICASSP2019 Tutorial: Detection and Classification of Acoustic Scenes and Events / Code examples☆42Jun 3, 2025Updated 10 months ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Mar 7, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Subband PCA feature calculation☆16Nov 5, 2018Updated 7 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 6 years ago
- ☆134Jul 21, 2021Updated 4 years ago
- Fetch and use Google's AudioSet dataset☆127Apr 13, 2017Updated 9 years ago
- ☆12Oct 2, 2020Updated 5 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Siamese network for unsupervised speech representation learning☆11Oct 12, 2018Updated 7 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 5 years ago
- Zicx's Notebook.☆10Nov 7, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- crawl profiles of Japanese PornStars from Javhoo.com☆12Feb 8, 2020Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Official code for "Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model"☆12Oct 29, 2022Updated 3 years ago
- Official repository of "TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection"☆11May 25, 2025Updated 10 months ago
- ☆49Aug 30, 2024Updated last year
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.☆50May 1, 2019Updated 6 years ago
- Phoneme recognizer based on long temporal context (with ALIZE VAD command added)☆17Apr 7, 2012Updated 14 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Attacks against proposed image encryption schemes☆10Apr 27, 2020Updated 5 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆106Aug 1, 2023Updated 2 years ago
- Phone generation model/VAE/GAN/VAE+GAN☆20Jun 26, 2018Updated 7 years ago
- ☆10Nov 29, 2019Updated 6 years ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- Cross-Layer Similarity Knowledge Distillation for Speech Enhancement☆11Jun 22, 2023Updated 2 years ago