☆13Jan 3, 2024Updated 2 years ago
Alternatives and similar repositories for LGC-SED
Users that are interested in LGC-SED are comparing it to the libraries listed below
Sorting:
- ☆11Dec 28, 2023Updated 2 years ago
- official implementation of MGA-CLAP (ACM MM 2024)☆30Oct 25, 2024Updated last year
- ☆20Mar 6, 2025Updated last year
- Vabs-Net: Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains☆18Sep 12, 2024Updated last year
- ☆28Oct 17, 2024Updated last year
- Polyphonic Sound Detection Score (PSDS)☆15Jan 20, 2020Updated 6 years ago
- KDD2024-WhoIsWho-Top3☆16Jun 17, 2024Updated last year
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆159Aug 24, 2025Updated 6 months ago
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆50Oct 23, 2025Updated 4 months ago
- baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift☆18Mar 16, 2024Updated last year
- ☆95Jun 22, 2023Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆57Jun 12, 2025Updated 8 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- Implement CBAM:Convolutional Block Attention Module with TensorFlow☆27Jan 17, 2019Updated 7 years ago
- ☆15Feb 10, 2025Updated last year
- ☆11Jul 4, 2024Updated last year
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆27Oct 29, 2025Updated 4 months ago
- ☆10Sep 25, 2024Updated last year
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Public repository of Google Colab notebooks to use with Phenix☆12Mar 19, 2025Updated 11 months ago
- ☆32Updated this week
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 7 months ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- ☆16Oct 9, 2024Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- [NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?☆43Jun 9, 2024Updated last year
- Non-Intrusive Appliance Load Monitoring (NILM) based on Convolutional Neural Networks for PyTorch☆11Sep 5, 2020Updated 5 years ago
- A Framework for Symbolic MUsic Graph Explanations☆10Jul 30, 2025Updated 7 months ago
- quagga☆10Apr 7, 2020Updated 5 years ago
- Reading list for research topics in Sound AI☆196Aug 8, 2024Updated last year
- "Roll with the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning" by Yue Duan (AAAI 2024…☆13Nov 20, 2025Updated 3 months ago
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations☆17Mar 31, 2025Updated 11 months ago
- Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding☆19May 5, 2025Updated 10 months ago
- ☆13Jul 28, 2023Updated 2 years ago
- ☆11Jul 6, 2022Updated 3 years ago
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 5 months ago
- ☆13Jul 4, 2022Updated 3 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year