colaudiolab / AudioCILLinks
Welcome to AudioCIL, the toolbox for audio class-incremental learning with the most implemented methods.
☆32Updated 6 months ago
Alternatives and similar repositories for AudioCIL
Users that are interested in AudioCIL are comparing it to the libraries listed below
Sorting:
- ☆20Updated 4 months ago
- This repository is the official implementation of our paper "Improving Generalization for AI-Synthesized Voice Detection", which has been…☆16Updated last month
- Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs…☆21Updated 4 months ago
- ☆24Updated 9 months ago
- Multimodal Classification and Out-of-distribution Detection☆13Updated 3 months ago
- [NeurIPS 2024] Code, Dataset, Samples for the VATT paper “ Tell What You Hear From What You See - Video to Audio Generation Through Text”☆15Updated this week
- Using Pre-trained SSL Transformer Models for Speaker Verification☆9Updated 9 months ago
- The first opensource platform for multimodal intent analysis☆9Updated 6 months ago
- Training code for MaskGCT-T2S model.☆20Updated 7 months ago
- ☆8Updated 7 months ago
- MBTI dataset,Sentiment Dataset,Micro Emotion,微博情感数据集,multi-label Chinese affective computing dataset. personality traits with six emotion…☆13Updated last month
- ☆16Updated last month
- Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)☆23Updated 4 months ago
- official implementation of MGA-CLAP (ACM MM 2024)☆17Updated 8 months ago
- 本项目主要是2025届浙江大学软件学院夏令营(AI营)的考核项目☆11Updated 4 months ago
- 本项目实现了一个完整的声源定位与声压级分析系统,包括波束形成、DAMAS系列算法以及FISTA算法等多种声源定位方法。系统能够处理多频率声源信号,生成声源定位图像,并分析不同方法下的声压级特性.☆12Updated 6 months ago
- This repository collects papers related to Speech Tokenizer.☆17Updated 9 months ago
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆57Updated 5 months ago
- ☆12Updated 2 years ago
- [ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"☆27Updated 2 months ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆31Updated 4 months ago
- [ICASSP 2025] "Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attentio…☆22Updated 2 months ago
- The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source cod…☆109Updated last year
- Continual Learning Method RAWM for ICML 2023☆23Updated 9 months ago
- This is a general framework for fake audio detection using pytorch lightning☆23Updated last week
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆70Updated 10 months ago
- This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.☆10Updated 6 months ago
- Pytorch implementation of "LEVERAGING POSITIONAL-RELATED LOCAL-GLOBAL DEPENDENCY FOR SYNTHETIC SPEECH DETECTION"☆31Updated last year
- The Official benchmark for continual learning for deepfake audio detection☆20Updated 9 months ago
- Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.☆19Updated 7 months ago