sithu31296/audio-tagging

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sithu31296/audio-tagging)

sithu31296 / audio-tagging

Easy to use Audio Tagging in PyTorch

☆23

Alternatives and similar repositories for audio-tagging

Users that are interested in audio-tagging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rfalcon100 / seld_dcase2022_ric
View on GitHub
My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.
☆12Nov 12, 2022Updated 3 years ago
dr-costas / SEDLM
View on GitHub
Language modelling for sound event detection
☆20Jan 2, 2020Updated 6 years ago
jim-schwoebel / sound_event_detection
View on GitHub
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
☆47Feb 20, 2022Updated 4 years ago
JunhoKim94 / ASR_project
View on GitHub
This repository created for the NHN ASR hackathon competition.
☆11Sep 20, 2023Updated 2 years ago
soham97 / awesome-sound_event_detection
View on GitHub
Reading list for research topics in Sound AI
☆201Aug 8, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
MTG / DCASE-models
View on GitHub
Python library for rapid prototyping of environmental sound analysis systems
☆44May 20, 2022Updated 4 years ago
RicherMans / CED
View on GitHub
Source code for Consistent ensemble distillation for audio tagging
☆75Mar 20, 2026Updated 4 months ago
gemengtju / L-SpEx
View on GitHub
☆39Feb 23, 2022Updated 4 years ago
dhimasryan / TMHINT-QI-VoiceMOS2023
View on GitHub
☆17Oct 18, 2023Updated 2 years ago
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
sid0710 / audio_data_augmentation
View on GitHub
☆26Sep 14, 2017Updated 8 years ago
liam-kelley / RIR-in-a-Box
View on GitHub
Code for the paper "RIR-in-a-Box : Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation" presented at Interspeech 20…
☆16Sep 1, 2024Updated last year
RicherMans / SAT
View on GitHub
Streaming Audiotransformers for online Audio tagging
☆57Jun 14, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fschmid56 / EfficientAT
View on GitHub
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …
☆353Nov 20, 2024Updated last year
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
hecko-yes / tts-dataset-prompts
View on GitHub
Finally, some decent sample sentences
☆24Dec 3, 2023Updated 2 years ago
dr-costas / dnd-sed
View on GitHub
Sound event detection with depthwise separable and dilated convolutions.
☆53Mar 30, 2020Updated 6 years ago
YoshikiMas / madeon-asr
View on GitHub
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
☆19Dec 1, 2024Updated last year
kts707 / real-time-audio-denoiser
View on GitHub
A CNN-based audio denoiser
☆10May 2, 2021Updated 5 years ago
aminEdraki / py-intelligibility
View on GitHub
Python implementation of a few speech intelligibility prediction algorithms
☆15May 29, 2024Updated 2 years ago
soham97 / sound_ai_progress
View on GitHub
Tracking states of the arts and recent results (bibliography) on sound tasks.
☆33Jan 10, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
CUC-MIPG / UnifyEdit
View on GitHub
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model
☆13Dec 29, 2024Updated last year
newjins-papa / android-rnnoise
View on GitHub
☆16Nov 17, 2020Updated 5 years ago
mxmaxi007 / Variable_Length_Emotion_Recognition
View on GitHub
Classify the emotions from variable-length speech segments
☆11Mar 29, 2018Updated 8 years ago
YosukeSugiura / ActiveNoiseControl
View on GitHub
能動騒音制御(Active Noise Control)の説明資料
☆35Aug 5, 2022Updated 3 years ago
WangHelin1997 / GL-AT
View on GitHub
Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.
☆13Feb 6, 2021Updated 5 years ago
sainathadapa / dcase2019-task5-urban-sound-tagging
View on GitHub
1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging
☆30Mar 19, 2021Updated 5 years ago
onolab-tmu / asp-tutorial-2022
View on GitHub
Ono laboratory audio signal processing exercise for beginners.
☆19May 10, 2023Updated 3 years ago
cyrta / awesome-speech-enhancement
View on GitHub
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
☆69Sep 9, 2019Updated 6 years ago
jagger2048 / WebRtc_AGC1
View on GitHub
This repository is webrtc agc module demo.
☆12Jan 23, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yuekaizhang / minutes
View on GitHub
Podcast Summarizer with LLM Technology
☆30May 28, 2025Updated last year
int-brain-lab / analysis
View on GitHub
Initial repo for behavioral analyses
☆11Aug 24, 2022Updated 3 years ago
FrancoisGrondin / steernet
View on GitHub
☆27May 14, 2020Updated 6 years ago
zyfu0000 / lameHelper
View on GitHub
A c++ wrapper for the LAME library that reduces conversion of PCM (*.wav) to mp3 and vice versa to just two lines of codes.
☆12Jan 8, 2015Updated 11 years ago
jovan-stojanovic / Animal-sound-recognition
View on GitHub
Deep learning model for animal sound classification.
☆35May 4, 2024Updated 2 years ago
Xiaobin-Rong / lite-rtse
View on GitHub
An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement
☆14Nov 19, 2023Updated 2 years ago
stdKonjac / DeepComplexCRN
View on GitHub
☆13Mar 22, 2021Updated 5 years ago