Ming-er / LGC-SEDView external linksLinks
☆13Jan 3, 2024Updated 2 years ago
Alternatives and similar repositories for LGC-SED
Users that are interested in LGC-SED are comparing it to the libraries listed below
Sorting:
- ☆11Dec 28, 2023Updated 2 years ago
- official implementation of MGA-CLAP (ACM MM 2024)☆28Oct 25, 2024Updated last year
- Vabs-Net: Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains☆18Sep 12, 2024Updated last year
- AAAI 2025☆16Dec 13, 2024Updated last year
- ☆76Mar 11, 2024Updated last year
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆50Oct 23, 2025Updated 3 months ago
- baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift☆18Mar 16, 2024Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆56Jun 12, 2025Updated 8 months ago
- This repository aims to collect Transformer-based sound event detection (SED) algorithms.☆88Nov 4, 2025Updated 3 months ago
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- ☆37Jul 4, 2024Updated last year
- ☆113May 13, 2025Updated 9 months ago
- ☆15Feb 10, 2025Updated last year
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆13Aug 28, 2024Updated last year
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- Public repository of Google Colab notebooks to use with Phenix☆12Mar 19, 2025Updated 10 months ago
- Agentic Keyframe Search for Video Question Answering☆15Apr 7, 2025Updated 10 months ago
- ☆11Sep 25, 2024Updated last year
- quagga☆10Apr 7, 2020Updated 5 years ago
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆27Oct 29, 2025Updated 3 months ago
- Non-Intrusive Appliance Load Monitoring (NILM) based on Convolutional Neural Networks for PyTorch☆11Sep 5, 2020Updated 5 years ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 6 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 4 months ago
- ☆16Oct 9, 2024Updated last year
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45May 9, 2022Updated 3 years ago
- Reading list for research topics in Sound AI☆196Aug 8, 2024Updated last year
- [EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…☆14Mar 4, 2025Updated 11 months ago
- Official PyTorch implementation of the paper entitled 'Self Attentive Pooling for Efficient Deep Learning'.☆13May 3, 2024Updated last year
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations☆17Mar 31, 2025Updated 10 months ago
- Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding☆19May 5, 2025Updated 9 months ago
- ☆11Jul 6, 2022Updated 3 years ago
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- Official repo for pUniFind, the first open de novo sequencing and open database search rescoring deep learning model.☆29Feb 1, 2026Updated last week
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆24Aug 8, 2025Updated 6 months ago
- ☆38Dec 19, 2025Updated last month
- 针对CN-Celeb数据集的基于ECAPA-TDNN的说话人识别的pytorch实现☆13Apr 3, 2023Updated 2 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- ☆13Jul 4, 2022Updated 3 years ago