arxyzan / data2vec-pytorch
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
☆172Updated last year
Related projects ⓘ
Alternatives and complementary repositories for data2vec-pytorch
- Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".☆232Updated 7 months ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆109Updated last year
- An implementation of local windowed attention for language modeling☆383Updated 2 months ago
- [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.☆137Updated 8 months ago
- Sequence modeling with Mega.☆297Updated last year
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆35Updated 11 months ago
- Unofficial PyTorch Implementation for pNLP-Mixer: an Efficient all-MLP Architecture for Language (https://arxiv.org/abs/2202.04350)☆62Updated 2 years ago
- A minimal pytorch package implementing a gradient reversal layer.☆155Updated this week
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆116Updated 3 years ago
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆95Updated last year
- Implementation of Dat2Vec2.0 for vision☆16Updated last year
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆203Updated last year
- A simple cross attention that updates both the source and target in one step☆150Updated 6 months ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆37Updated 3 years ago
- The repo host the code and model of MAViL.☆42Updated last year
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆222Updated 2 years ago
- ☆216Updated 3 years ago
- Implementation of Linformer for Pytorch☆255Updated 10 months ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆205Updated last year
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆86Updated 2 years ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 2 years ago
- Official code for Wav2Seq☆95Updated 2 years ago
- [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition☆245Updated last year
- Audio Captioning datasets for PyTorch.☆105Updated this week
- Unofficial PyTorch implementation of Google's FNet: Mixing Tokens with Fourier Transforms. With checkpoints.☆67Updated 2 years ago
- Gradient Reversal Layer for Domain Adaptation☆106Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆132Updated 9 months ago
- Additive margin softmax loss in pytorch☆45Updated 5 years ago
- This repo hosts the code and models of "Masked Autoencoders that Listen".☆541Updated 7 months ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆348Updated 3 years ago