Download and create a tfreader for the audioset dataset
☆16Apr 16, 2020Updated 5 years ago
Alternatives and similar repositories for audioset_raw
Users that are interested in audioset_raw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains code that was used as an example of how to use Python to download part of the AudioSet dataset and use Tensorflo…☆13Aug 24, 2017Updated 8 years ago
- Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020☆16Oct 20, 2020Updated 5 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆31Jun 17, 2024Updated last year
- Conferencing Speech Challenge☆95Apr 6, 2021Updated 4 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.☆35Jul 8, 2024Updated last year
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Mar 7, 2021Updated 5 years ago
- Subband PCA feature calculation☆16Nov 5, 2018Updated 7 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 5 years ago
- ☆132Jul 21, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Fetch and use Google's AudioSet dataset☆127Apr 13, 2017Updated 8 years ago
- ☆12Oct 2, 2020Updated 5 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Siamese network for unsupervised speech representation learning☆11Oct 12, 2018Updated 7 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Zicx's Notebook.☆10Nov 7, 2025Updated 4 months ago
- crawl profiles of Japanese PornStars from Javhoo.com☆12Feb 8, 2020Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Official code for "Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model"☆12Oct 29, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official repository of "TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection"☆11May 25, 2025Updated 10 months ago
- ☆48Aug 30, 2024Updated last year
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.☆50May 1, 2019Updated 6 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Phoneme recognizer based on long temporal context (with ALIZE VAD command added)☆17Apr 7, 2012Updated 13 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆105Aug 1, 2023Updated 2 years ago
- Phone generation model/VAE/GAN/VAE+GAN☆20Jun 26, 2018Updated 7 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆10Nov 29, 2019Updated 6 years ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- Cross-Layer Similarity Knowledge Distillation for Speech Enhancement☆11Jun 22, 2023Updated 2 years ago
- 计算机图形学课程设计带报告,OpenGL、Qt,图形绘制系统,画图板,release版,exe直接运行☆11Feb 9, 2022Updated 4 years ago
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- Chinese word segmentation with the neural seq2seq model implement in pytorch☆10Dec 13, 2017Updated 8 years ago
- ☆15Jul 9, 2019Updated 6 years ago