alireza-nasiri / SoundCLRLinks
Implementation for "SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification," in pytorch.
☆28Updated 2 years ago
Alternatives and similar repositories for SoundCLR
Users that are interested in SoundCLR are comparing it to the libraries listed below
Sorting:
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Updated last year
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆65Updated last year
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Updated 4 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆51Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated 2 years ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Updated 4 years ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31Updated 2 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆48Updated 3 years ago
- ☆15Updated 7 months ago
- MSP-Podcast Challenge Baseline Code☆30Updated last year
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Updated 3 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆50Updated last year
- ☆29Updated 3 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆89Updated 3 years ago
- experiments about AudioSet☆43Updated 2 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45Updated 3 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Updated 3 years ago
- ☆95Updated 2 years ago
- Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge☆60Updated 10 months ago
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆21Updated 7 months ago
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆26Updated last year
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆30Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Updated 2 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆71Updated 3 years ago
- ☆13Updated last year
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆21Updated 3 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆57Updated 3 months ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆35Updated 2 years ago