Selimonder / birdclef-2022Links
☆30Updated 3 years ago
Alternatives and similar repositories for birdclef-2022
Users that are interested in birdclef-2022 are comparing it to the libraries listed below
Sorting:
- Codebase for BirdClef 2023 solution☆46Updated 2 years ago
- ☆51Updated 4 years ago
- BirdCLEF 2021 - Birdcall Identification 4th place solution☆50Updated 4 years ago
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆91Updated 3 years ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆148Updated 3 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆90Updated 5 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- ☆89Updated 4 years ago
- ☆29Updated 4 years ago
- 1st Place solution to the Cornell Birdcall Identification competition.☆153Updated 5 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆117Updated 3 years ago
- PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI☆184Updated 2 years ago
- [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.☆137Updated last year
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆70Updated 3 years ago
- ☆12Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆48Updated 11 months ago
- [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition☆263Updated 2 years ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Updated 3 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆224Updated 2 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆96Updated last year
- This repository contains code for 3rd place in the Feedback-Prize---English-Language-Learning which was hosted on kaggle☆20Updated 2 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 4 years ago
- Dippy Synthetic Speech Subnet☆17Updated 3 months ago
- small experimentation about positional encoding☆19Updated 5 years ago
- ☆34Updated last year
- ☆37Updated last year
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆17Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago