Easy to use Audio Tagging in PyTorch
â23Aug 22, 2021Updated 4 years ago
Alternatives and similar repositories for audio-tagging
Users that are interested in audio-tagging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.â12Nov 12, 2022Updated 3 years ago
- ðĩ A repository for manually annotating files to create labeled acoustic datasets for machine learning.â47Feb 20, 2022Updated 4 years ago
- Language modelling for sound event detectionâ20Jan 2, 2020Updated 6 years ago
- This repository created for the NHN ASR hackathon competition.â11Sep 20, 2023Updated 2 years ago
- Reading list for research topics in Sound AIâ196Aug 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient âĒ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speechâ11May 14, 2025Updated 10 months ago
- Python library for rapid prototyping of environmental sound analysis systemsâ44May 20, 2022Updated 3 years ago
- Source code for Consistent ensemble distillation for audio taggingâ60Mar 20, 2026Updated last week
- Research_speech_speaker_verification_nist_sre2010â12Mar 1, 2016Updated 10 years ago
- â37Feb 23, 2022Updated 4 years ago
- A toolkit for researchers in the multimodal sound separation.â16Oct 20, 2023Updated 2 years ago
- â26Sep 14, 2017Updated 8 years ago
- Streaming Audiotransformers for online Audio taggingâ53Jun 14, 2024Updated last year
- â17Oct 18, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways âĒ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training âĶâ335Nov 20, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.â18Aug 1, 2025Updated 7 months ago
- KWS demo based on CTC prefix beam search.â17Oct 21, 2023Updated 2 years ago
- â21Jul 15, 2024Updated last year
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp filesâ57Sep 1, 2025Updated 6 months ago
- Sound event detection with depthwise separable and dilated convolutions.â53Mar 30, 2020Updated 5 years ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognitionâ18Dec 1, 2024Updated last year
- A CNN-based audio denoiserâ10May 2, 2021Updated 4 years ago
- Decoders from Kaldi using OpenFstâ34Jan 29, 2026Updated last month
- DigitalOcean Gradient AI Platform âĒ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tracking states of the arts and recent results (bibliography) on sound tasks.â32Jan 10, 2023Updated 3 years ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Modelâ13Dec 29, 2024Updated last year
- â16Nov 17, 2020Updated 5 years ago
- Python implementation of a few speech intelligibility prediction algorithmsâ15May 29, 2024Updated last year
- č―åéĻéģåķåūĄ(Active Noise Control)ãŪ芎æčģæâ33Aug 5, 2022Updated 3 years ago
- Deep learning model for animal sound classification.â35May 4, 2024Updated last year
- Classify the emotions from variable-length speech segmentsâ11Mar 29, 2018Updated 7 years ago
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.â67Sep 9, 2019Updated 6 years ago
- Ono laboratory audio signal processing exercise for beginners.â19May 10, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI âĒ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.â13Feb 6, 2021Updated 5 years ago
- For accessing to the dataset, please send your short bio and objective of the study to Dr.Theerawit Wilaiprasitporn (theerawit dot w at vâĶâ14Apr 29, 2021Updated 4 years ago
- A database of clean and noisy speech for audio researchâ10Jan 26, 2018Updated 8 years ago
- Podcast Summarizer with LLM Technologyâ30May 28, 2025Updated 10 months ago
- This repository is webrtc agc module demo.â12Jan 23, 2019Updated 7 years ago
- AI Music Structure Analyzer + Stem Splitter using Demucs & Mdx-Net with Python-Audio-Separator | Cog | Replicateâ12Mar 3, 2024Updated 2 years ago
- TTS Text Analyzerâ31Jul 20, 2023Updated 2 years ago