Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"
☆17Nov 9, 2022Updated 3 years ago
Alternatives and similar repositories for MTL_Weakly_labelled_audio_data
Users that are interested in MTL_Weakly_labelled_audio_data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for the paper "Towards duration robust weakly supervised sound event detection"☆23Aug 3, 2023Updated 2 years ago
- Reading list for research topics in Sound AI☆196Aug 8, 2024Updated last year
- ☆46Dec 17, 2018Updated 7 years ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- Acoustic event detection using recurrent neural networks.☆11Sep 4, 2018Updated 7 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆11Nov 6, 2024Updated last year
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Jul 24, 2023Updated 2 years ago
- ☆20Apr 11, 2019Updated 6 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆131Jul 24, 2020Updated 5 years ago
- Myanmar lexicon analyzer - Sorting and Segmentation☆10Aug 11, 2021Updated 4 years ago
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Sep 16, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆68Sep 13, 2024Updated last year
- ☆10Aug 9, 2018Updated 7 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆31Apr 29, 2022Updated 3 years ago
- [IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer☆221Nov 30, 2025Updated 3 months ago
- ☆12Sep 25, 2023Updated 2 years ago
- ☆11Mar 28, 2023Updated 2 years ago
- ☆11Jul 4, 2024Updated last year
- Myanmar consonant and vowel audio files that I recorded at University of Computer Studies Banmaw☆11Mar 2, 2019Updated 7 years ago
- Official PyTorch implementation of the paper entitled 'Self Attentive Pooling for Efficient Deep Learning'.☆13May 3, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- Code and Dataset for our CVPR 2022 paper "Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training"☆12Jul 8, 2022Updated 3 years ago
- ☆54Jun 3, 2020Updated 5 years ago
- Baseline code for DCASE 2023 task 4 B☆15Apr 21, 2023Updated 2 years ago
- The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations☆11Jan 17, 2023Updated 3 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Mar 20, 2024Updated 2 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 5 years ago
- ☆13Jan 3, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.☆14Oct 29, 2023Updated 2 years ago
- ☆11Oct 24, 2022Updated 3 years ago
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Jan 10, 2023Updated 3 years ago
- ☆10Jul 16, 2024Updated last year
- ☆14Jan 31, 2023Updated 3 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Jul 13, 2023Updated 2 years ago
- Fast Contextual Scene Graph Generation with Unbiased Context Augmentation☆12Aug 7, 2023Updated 2 years ago