Sara-Ahmed / ASiT
ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation
☆21Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for ASiT
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- Learning differentiable temporal resolution on time-series data.☆32Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- ☆54Updated last month
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆31Updated 5 months ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆34Updated last month
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆22Updated 7 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆35Updated 3 months ago
- Stable Audio UnOffical Implementation: Latent Diffusion for Audio Generation☆23Updated 8 months ago
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆18Updated last year
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆28Updated last month
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆15Updated 2 years ago
- A neural speech codec based on discrete WavLM representations☆21Updated 2 months ago
- ☆48Updated last year
- AudioLDM training, finetuning, evaluation and inference.☆12Updated 7 months ago
- experiments about AudioSet☆43Updated last year
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆27Updated 2 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆42Updated this week
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆39Updated last year
- TODO☆34Updated last year
- EVAR ~ Evaluation package for Audio Representations☆43Updated this week
- ☆12Updated 2 years ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆78Updated 7 months ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆39Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆83Updated 2 years ago
- SRTNet☆24Updated last year
- ☆58Updated last year
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆21Updated last month
- ☆15Updated 2 years ago