haoheliu / diffres-python
Learning differentiable temporal resolution on time-series data.
☆32Updated last year
Related projects ⓘ
Alternatives and complementary repositories for diffres-python
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆31Updated 5 months ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆34Updated last month
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆39Updated last year
- ☆62Updated last month
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 2 years ago
- ☆18Updated 2 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆83Updated 2 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆44Updated 2 years ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆18Updated 10 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆105Updated 2 months ago
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 2 weeks ago
- ☆41Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆33Updated 7 months ago
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- experiments about AudioSet☆43Updated last year
- EVAR ~ Evaluation package for Audio Representations☆43Updated this week
- ☆26Updated last year
- Inference code for PaSST, using the HEAR API.☆29Updated 10 months ago
- ☆45Updated last year
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆27Updated 2 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- ☆64Updated last year
- AudioLDM training, finetuning, evaluation and inference.☆12Updated 7 months ago
- Code for CVSSP submission to DCASE 2021 Task 6☆35Updated last year
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆21Updated 7 months ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆28Updated last month
- Query-conditioned target sound extraction model☆16Updated last week
- ☆54Updated last month
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆32Updated last year