arxyzan / data2vec-pytorch
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
☆177Updated last year
Alternatives and similar repositories for data2vec-pytorch:
Users that are interested in data2vec-pytorch are comparing it to the libraries listed below
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆97Updated last year
- ☆164Updated 2 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆112Updated 2 years ago
- A simple cross attention that updates both the source and target in one step☆166Updated 10 months ago
- An implementation of local windowed attention for language modeling☆431Updated 2 months ago
- 🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps☆159Updated last month
- Sequence modeling with Mega.☆295Updated 2 years ago
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆89Updated 2 years ago
- [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition☆250Updated 2 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆225Updated 2 years ago
- This repo hosts the code and models of "Masked Autoencoders that Listen".☆576Updated 11 months ago
- Implementation of Linformer for Pytorch☆276Updated last year
- The repo host the code and model of MAViL.☆42Updated last year
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆76Updated 4 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆204Updated last year
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆188Updated 2 years ago
- Official code for Wav2Seq☆96Updated 2 years ago
- A minimal pytorch package implementing a gradient reversal layer.☆158Updated 4 months ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆90Updated 9 months ago
- Implementation of Memformer, a Memory-augmented Transformer, in Pytorch☆115Updated 4 years ago
- Relative Positional Encoding for Transformers with Linear Complexity☆62Updated 3 years ago
- Implement the paper "Self-Attention with Relative Position Representations"☆128Updated 4 years ago
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆36Updated last year
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- Implementation of the convolutional module from the Conformer paper, for use in Transformers☆389Updated last year
- Implementation of Fast Transformer in Pytorch☆173Updated 3 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆70Updated 2 years ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆42Updated last year
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆210Updated last year
- Unofficial PyTorch Implementation for pNLP-Mixer: an Efficient all-MLP Architecture for Language (https://arxiv.org/abs/2202.04350)☆63Updated 3 years ago