☆11Dec 28, 2023Updated 2 years ago
Alternatives and similar repositories for Audio-Free-P-Tuning
Users that are interested in Audio-Free-P-Tuning are comparing it to the libraries listed below
Sorting:
- ☆13Jan 3, 2024Updated 2 years ago
- official implementation of MGA-CLAP (ACM MM 2024)☆30Oct 25, 2024Updated last year
- This repository collects papers related to Speech Tokenizer.☆17Oct 16, 2024Updated last year
- KDD2024-WhoIsWho-Top3☆16Jun 17, 2024Updated last year
- This project made use of both intensity and phase information to recognize orbital angular momentum mode.☆36Jul 8, 2024Updated last year
- Vabs-Net: Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains☆18Sep 12, 2024Updated last year
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆50Oct 23, 2025Updated 4 months ago
- ☆28Oct 17, 2024Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- This repository aims to collect Transformer-based sound event detection (SED) algorithms.☆93Feb 10, 2026Updated 3 weeks ago
- ☆76Mar 11, 2024Updated last year
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆20Updated this week
- ☆40Feb 18, 2026Updated 2 weeks ago
- ☆114May 13, 2025Updated 9 months ago
- ☆50Apr 13, 2025Updated 10 months ago
- ☆15Feb 10, 2025Updated last year
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Jan 27, 2025Updated last year
- Curated list for papers, codes and resources related to Text-to-Audio (TTA) Generation☆69Jan 22, 2026Updated last month
- Visually-Aware Audio Captioning☆43Mar 3, 2023Updated 3 years ago
- Public repository of Google Colab notebooks to use with Phenix☆12Mar 19, 2025Updated 11 months ago
- Non-Intrusive Appliance Load Monitoring (NILM) based on Convolutional Neural Networks for PyTorch☆11Sep 5, 2020Updated 5 years ago
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'☆51May 17, 2022Updated 3 years ago
- ☆10Oct 16, 2025Updated 4 months ago
- ☆10Sep 25, 2024Updated last year
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Mar 9, 2024Updated last year
- lssvm python version☆11Nov 24, 2015Updated 10 years ago
- Explaining audio differences using language☆16Feb 11, 2025Updated last year
- This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"☆13Jun 5, 2023Updated 2 years ago
- Code for WACV24 work for multiview acoustic-visual detection☆13Mar 22, 2024Updated last year
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- ☆43Jan 13, 2025Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Jun 13, 2024Updated last year
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations☆17Mar 31, 2025Updated 11 months ago
- Deepfake cross-lingual evaluation dataset (DECRO) is constructed to evaluate the influence of language differences on deepfake detection.…☆16Sep 14, 2023Updated 2 years ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆33Oct 15, 2025Updated 4 months ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year