ilyassmoummad / ProtoCLRLinks
Pytorch implementation of our work "Domain-Invariant Representation Learning of Bird Sounds" (arXiv 2024)
☆10Updated 6 months ago
Alternatives and similar repositories for ProtoCLR
Users that are interested in ProtoCLR are comparing it to the libraries listed below
Sorting:
- A benchmark dataset collection for bird sound classification☆52Updated last month
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆92Updated last year
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆217Updated 2 years ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31Updated 2 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- Splits for epic-sounds dataset☆81Updated last month
- Official Pytorch implementation of the "A Model You Can Hear: Audio Identification with Playable Prototypes" paper☆37Updated 3 years ago
- Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch☆74Updated 3 years ago
- ☆18Updated 4 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆25Updated 2 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 8 months ago
- ☆13Updated 3 years ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Updated last year
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆55Updated 2 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆12Updated 10 months ago
- Repo for Visual Acoustic Matching, CVPR 2022☆68Updated 2 years ago
- Source code for the paper 'Audio Captioning Transformer'☆56Updated 3 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆88Updated 3 years ago
- Reference implementation of DecDTW in PyTorch (ICLR 2023)☆23Updated 2 years ago
- ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.☆61Updated 3 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Updated 2 years ago
- Python code for handling the Clotho dataset.☆83Updated 4 years ago
- iNatSounds Dataset☆21Updated 10 months ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆85Updated last year
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆56Updated last month
- ☆66Updated 2 years ago
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆19Updated 2 years ago
- This repo contains the evaluation code for the INQUIRE benchmark☆53Updated 8 months ago
- The repo host the code and model of MAViL.☆44Updated 2 years ago