ilyassmoummad / ProtoCLRLinks
Pytorch implementation of our work "Domain-Invariant Representation Learning of Bird Sounds" (arXiv 2024)
☆10Updated 7 months ago
Alternatives and similar repositories for ProtoCLR
Users that are interested in ProtoCLR are comparing it to the libraries listed below
Sorting:
- A benchmark dataset collection for bird sound classification☆55Updated last month
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆95Updated last year
- ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.☆62Updated 3 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆25Updated 2 years ago
- Official Pytorch implementation of the "A Model You Can Hear: Audio Identification with Playable Prototypes" paper☆37Updated 3 years ago
- Splits for epic-sounds dataset☆83Updated 2 months ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆218Updated 2 years ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31Updated 2 years ago
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆55Updated 2 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆85Updated last year
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆153Updated 10 months ago
- Reference implementation of DecDTW in PyTorch (ICLR 2023)☆23Updated 2 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 9 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Updated last year
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆192Updated 3 years ago
- ☆47Updated last year
- ☆15Updated 2 years ago
- Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.o…☆46Updated 4 years ago
- The repo host the code and model of MAViL.☆44Updated 2 years ago
- Project website for "Telling left from right: Learning spatial correspondence between sight and sound"☆23Updated 3 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation☆121Updated 2 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆116Updated 2 years ago
- Official PyTorch Implementation☆18Updated 2 years ago
- ☆13Updated 3 years ago
- Source code for the paper 'Audio Captioning Transformer'☆57Updated 3 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Updated last year
- Python code for handling the Clotho dataset.☆84Updated 4 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated last year
- Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".☆52Updated 2 months ago