This package aims at simplifying the download of the AudioSet dataset.
☆58Jul 17, 2025Updated 8 months ago
Alternatives and similar repositories for audioset-download
Users that are interested in audioset-download are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing☆48Aug 1, 2024Updated last year
- This package aims at simplifying the download of the AudioCaps dataset.☆36Dec 1, 2023Updated 2 years ago
- Toolkit for downloading and processing Google's AudioSet dataset.☆179Aug 22, 2025Updated 7 months ago
- A spoken version of the textual story cloze benchmark☆21Aug 6, 2023Updated 2 years ago
- Simple voice activity detection (VAD) algorithm in Python☆15Aug 10, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆17Nov 19, 2024Updated last year
- Pre-training BART model for the Italian Language☆16Dec 28, 2022Updated 3 years ago
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆29Dec 19, 2024Updated last year
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆61Jul 2, 2025Updated 8 months ago
- [ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis☆52Apr 9, 2025Updated 11 months ago
- Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.☆14Jul 25, 2023Updated 2 years ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆45Mar 27, 2024Updated 2 years ago
- AudioLDM training, finetuning, evaluation and inference.☆298Dec 13, 2024Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- provide SPHERE-formatted output as well as RIFF, AU, AIFF and raw☆14Dec 18, 2021Updated 4 years ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Mar 14, 2025Updated last year
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆26Apr 27, 2024Updated last year
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆198Dec 13, 2024Updated last year
- ARCH: Audio Representations benCHmark☆54Aug 26, 2024Updated last year
- Code for paper Learning Audio-Visual Dereverberation☆31Aug 10, 2022Updated 3 years ago
- Code to train a custom time-domain autoencoder to dereverb audio☆16Nov 30, 2023Updated 2 years ago
- iSeparate library for the SDX2023 challenge☆15Dec 15, 2023Updated 2 years ago
- ☆23Feb 2, 2022Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- SPEAR Challenge scripts and tools.☆25Mar 17, 2023Updated 3 years ago
- ☆10Jun 6, 2023Updated 2 years ago
- [IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer☆223Nov 30, 2025Updated 4 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆39Jan 6, 2024Updated 2 years ago
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆74Feb 13, 2025Updated last year
- VPN and Speed Test UI with SwiftUI☆11Aug 19, 2021Updated 4 years ago
- ☆38Jul 4, 2024Updated last year
- ☆11Apr 12, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆14Jun 16, 2023Updated 2 years ago
- A Playground for Variational Autoencoders☆12Feb 11, 2018Updated 8 years ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆15Aug 1, 2024Updated last year
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Jan 26, 2024Updated 2 years ago
- ☆117Updated this week
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆210Jul 14, 2022Updated 3 years ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆11May 8, 2022Updated 3 years ago