arxyzan / data2vec-pytorchLinks
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
☆182Updated 2 years ago
Alternatives and similar repositories for data2vec-pytorch
Users that are interested in data2vec-pytorch are comparing it to the libraries listed below
Sorting:
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆116Updated 2 years ago
- [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition☆261Updated 2 years ago
- Implementation of the convolutional module from the Conformer paper, for use in Transformers☆429Updated 2 years ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆147Updated 2 years ago
- ☆72Updated 4 years ago
- Official code for Wav2Seq☆97Updated 3 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆221Updated 2 years ago
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆96Updated 2 years ago
- A simple cross attention that updates both the source and target in one step☆185Updated 3 months ago
- A minimal pytorch package implementing a gradient reversal layer.☆158Updated last year
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆91Updated 3 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆88Updated 5 years ago
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆37Updated last year
- Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning☆165Updated last year
- [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.☆138Updated last year
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- Unofficial PyTorch Implementation for pNLP-Mixer: an Efficient all-MLP Architecture for Language (https://arxiv.org/abs/2202.04350)☆64Updated 3 years ago
- An implementation of local windowed attention for language modeling☆484Updated 4 months ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆260Updated 4 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆96Updated last year
- Implementation of Fast Transformer in Pytorch☆177Updated 4 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆206Updated 2 years ago
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆192Updated 3 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆43Updated 11 months ago
- Python code for handling the Clotho dataset.☆85Updated 4 years ago
- Unofficial PyTorch implementation of Google's FNet: Mixing Tokens with Fourier Transforms. With checkpoints.☆77Updated 3 years ago
- The repo host the code and model of MAViL.☆44Updated 2 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆228Updated 3 years ago
- 🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps☆194Updated last month
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Updated 2 years ago