colaudiolab / AudioCILLinks
Welcome to AudioCIL, the toolbox for audio class-incremental learning with the most implemented methods.
☆32Updated 7 months ago
Alternatives and similar repositories for AudioCIL
Users that are interested in AudioCIL are comparing it to the libraries listed below
Sorting:
- official implementation of MGA-CLAP (ACM MM 2024)☆17Updated 9 months ago
- Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs…☆24Updated 5 months ago
- ☆13Updated last year
- ☆12Updated 2 years ago
- Continual Learning Method RWM for AAAI 2024☆22Updated 10 months ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆71Updated 11 months ago
- The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source cod…☆111Updated 3 weeks ago
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆57Updated 5 months ago
- ☆16Updated 3 months ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆32Updated 5 months ago
- This repository collects papers related to Speech Tokenizer.☆17Updated 9 months ago
- ☆13Updated 7 months ago
- Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)☆24Updated 5 months ago
- [ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"☆28Updated 3 months ago
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Updated 10 months ago
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆43Updated last month
- Continual Learning Method RAWM for ICML 2023☆23Updated 10 months ago
- ☆8Updated 8 months ago
- The Official Code Repo for EgoOrientBench [CVPR25]☆13Updated 3 months ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆15Updated 8 months ago
- [CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation☆37Updated 11 months ago
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆42Updated last month
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆89Updated 8 months ago
- Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge☆53Updated 4 months ago
- ☆20Updated 5 months ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆48Updated 2 months ago
- The Official benchmark for continual learning for deepfake audio detection☆20Updated 10 months ago
- Few-shot class-incremental audio classification, which can continually recognize novel audio classes without forgetting old ones☆9Updated 2 years ago
- Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix☆161Updated 2 months ago
- Collection of works for evaluating (and analyzing) large audio-language models (LALMs)☆33Updated this week