Selimonder / birdclef-2022Links
☆30Updated 3 years ago
Alternatives and similar repositories for birdclef-2022
Users that are interested in birdclef-2022 are comparing it to the libraries listed below
Sorting:
- Codebase for BirdClef 2023 solution☆46Updated 2 years ago
- ☆51Updated 4 years ago
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆91Updated 2 years ago
- BirdCLEF 2021 - Birdcall Identification 4th place solution☆50Updated 4 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆144Updated 2 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆115Updated 2 years ago
- [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.☆138Updated last year
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆86Updated 4 years ago
- ☆12Updated last year
- 1st Place solution to the Cornell Birdcall Identification competition.☆153Updated 4 years ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Updated 2 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago
- ☆89Updated 4 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆69Updated 3 years ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Updated 7 months ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆148Updated 2 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆92Updated last year
- PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI☆181Updated 2 years ago
- [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition☆259Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 8 months ago
- ☆28Updated 3 years ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Updated last year
- 6th Place Solution for the Google - Isolated Sign Language Recognition Kaggle Competition☆13Updated 2 years ago
- ☆49Updated 2 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆217Updated 2 years ago
- ☆18Updated 2 years ago
- Dippy Synthetic Speech Subnet☆17Updated 3 weeks ago