Selimonder / birdclef-2022Links
☆30Updated 3 years ago
Alternatives and similar repositories for birdclef-2022
Users that are interested in birdclef-2022 are comparing it to the libraries listed below
Sorting:
- Codebase for BirdClef 2023 solution☆46Updated 2 years ago
- BirdCLEF 2021 - Birdcall Identification 4th place solution☆50Updated 4 years ago
- ☆51Updated 4 years ago
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆90Updated 2 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆115Updated 2 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago
- [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.☆139Updated last year
- ☆88Updated 4 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 7 months ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆142Updated 2 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras☆44Updated 3 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated 2 years ago
- A Pytorch Implementations for Various Vector Quantization Methods☆30Updated 3 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆214Updated 2 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆92Updated last year
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆11Updated 2 years ago
- 6th Place Solution for the Google - Isolated Sign Language Recognition Kaggle Competition☆13Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆83Updated 4 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆77Updated last year
- PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI☆180Updated 2 years ago
- Dippy Synthetic Speech Subnet☆16Updated last month
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Updated 4 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆96Updated last month