arxyzan / data2vec-pytorchLinks
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
☆177Updated 2 years ago
Alternatives and similar repositories for data2vec-pytorch
Users that are interested in data2vec-pytorch are comparing it to the libraries listed below
Sorting:
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆115Updated 2 years ago
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆36Updated last year
- An implementation of local windowed attention for language modeling☆450Updated 4 months ago
- Implementation of Linformer for Pytorch☆286Updated last year
- ☆163Updated 2 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆225Updated 3 years ago
- Official code for Wav2Seq☆96Updated 2 years ago
- Sequence modeling with Mega.☆295Updated 2 years ago
- [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition☆252Updated 2 years ago
- [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.☆139Updated last year
- A simple cross attention that updates both the source and target in one step☆171Updated last year
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆119Updated 3 years ago
- 🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps☆172Updated 3 months ago
- This repository implements SummaryMixing, a simpler, faster and much cheaper replacement to self-attention for automatic speech recogniti…☆117Updated 8 months ago
- A minimal pytorch package implementing a gradient reversal layer.☆158Updated 7 months ago
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆96Updated last year
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆90Updated 2 years ago
- Axial Positional Embedding for Pytorch☆81Updated 3 months ago
- The repo host the code and model of MAViL.☆42Updated last year
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆204Updated last year
- Implementation of Memformer, a Memory-augmented Transformer, in Pytorch☆117Updated 4 years ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆141Updated 2 years ago
- An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper☆18Updated 2 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 4 years ago
- Implementation of Fast Transformer in Pytorch☆174Updated 3 years ago
- Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning☆160Updated last year
- Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".☆258Updated last year
- Gradient Reversal Layer for Domain Adaptation☆119Updated 2 years ago
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆55Updated 2 years ago
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆193Updated 2 years ago