sithu31296 / audio-taggingView external linksLinks
Easy to use Audio Tagging in PyTorch
â23Aug 22, 2021Updated 4 years ago
Alternatives and similar repositories for audio-tagging
Users that are interested in audio-tagging are comparing it to the libraries listed below
Sorting:
- This repository created for the NHN ASR hackathon competition.â11Sep 20, 2023Updated 2 years ago
- ðĩ A repository for manually annotating files to create labeled acoustic datasets for machine learning.â46Feb 20, 2022Updated 3 years ago
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.â12Nov 12, 2022Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speechâ11May 14, 2025Updated 9 months ago
- â17Oct 18, 2023Updated 2 years ago
- KWS demo based on CTC prefix beam search.â17Oct 21, 2023Updated 2 years ago
- A toolkit for researchers in the multimodal sound separation.â16Oct 20, 2023Updated 2 years ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognitionâ18Dec 1, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.â18Aug 1, 2025Updated 6 months ago
- â36Feb 23, 2022Updated 3 years ago
- Python implementation of a few speech intelligibility prediction algorithmsâ15May 29, 2024Updated last year
- â21Jul 15, 2024Updated last year
- Streaming Audiotransformers for online Audio taggingâ51Jun 14, 2024Updated last year
- Reading list for research topics in Sound AIâ196Aug 8, 2024Updated last year
- Language modelling for sound event detectionâ20Jan 2, 2020Updated 6 years ago
- â25May 14, 2020Updated 5 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp filesâ55Sep 1, 2025Updated 5 months ago
- Finally, some decent sample sentencesâ23Dec 3, 2023Updated 2 years ago
- Source code for Consistent ensemble distillation for audio taggingâ56Jun 12, 2025Updated 8 months ago
- Podcast Summarizer with LLM Technologyâ30May 28, 2025Updated 8 months ago
- Decoders from Kaldi using OpenFstâ34Jan 29, 2026Updated 2 weeks ago
- TTS Text Analyzerâ32Jul 20, 2023Updated 2 years ago
- â26Sep 14, 2017Updated 8 years ago
- A benchmark for evaluating audio encoders on various audio tasks.â42Dec 11, 2025Updated 2 months ago
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)â30Jul 9, 2024Updated last year
- Tracking states of the arts and recent results (bibliography) on sound tasks.â32Jan 10, 2023Updated 3 years ago
- This is the official implementation of reverberant speech to room impulse response estimatorâ40Aug 7, 2024Updated last year
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)â31May 14, 2024Updated last year
- â33Jun 29, 2023Updated 2 years ago
- Neural Dereverberationâ36May 22, 2019Updated 6 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training âĶâ328Nov 20, 2024Updated last year
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.â31May 22, 2025Updated 8 months ago
- č―åéĻéģåķåūĄ(Active Noise Control)ãŪ芎æčģæâ33Aug 5, 2022Updated 3 years ago
- â32Nov 24, 2024Updated last year
- A simple neural truecaser written in pytorch and allennlp.â33Jun 17, 2024Updated last year
- Official code of ElasticAST (Interspeech 2024 paper)â34Jul 30, 2024Updated last year
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.â67Sep 9, 2019Updated 6 years ago
- Sharif Emotional Speech Databaseâ39Jan 9, 2021Updated 5 years ago
- A collection of audio autoencoders, in PyTorch.â44Mar 7, 2023Updated 2 years ago