ilyassmoummad / ProtoCLRLinks
Pytorch implementation of our work "Domain-Invariant Representation Learning of Bird Sounds" (arXiv 2024)
☆10Updated 7 months ago
Alternatives and similar repositories for ProtoCLR
Users that are interested in ProtoCLR are comparing it to the libraries listed below
Sorting:
- A benchmark dataset collection for bird sound classification☆54Updated last week
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆94Updated last year
- Evaluation script for VoxMovies dataset in PyTorch☆23Updated last year
- Splits for epic-sounds dataset☆83Updated last month
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆55Updated 2 years ago
- iNatSounds Dataset☆21Updated 10 months ago
- ☆13Updated 3 years ago
- Project website for "Telling left from right: Learning spatial correspondence between sight and sound"☆23Updated 3 years ago
- ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.☆62Updated 3 years ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31Updated 2 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆85Updated last year
- Official Pytorch implementation of the "A Model You Can Hear: Audio Identification with Playable Prototypes" paper☆37Updated 3 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 9 months ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated last year
- Source code for the paper 'Audio Captioning Transformer'☆56Updated 3 years ago
- Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.☆32Updated 2 years ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆48Updated 11 months ago
- ☆18Updated 4 years ago
- ☆42Updated 2 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Updated 11 months ago
- Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"☆19Updated last year
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆152Updated 9 months ago
- ☆65Updated 3 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆25Updated 2 years ago
- The repo host the code and model of MAViL.☆44Updated 2 years ago
- Repo for Visual Acoustic Matching, CVPR 2022☆68Updated 2 years ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆120Updated 3 years ago
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆19Updated 2 years ago