arxyzan / data2vec-pytorch
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
☆177Updated last year
Alternatives and similar repositories for data2vec-pytorch:
Users that are interested in data2vec-pytorch are comparing it to the libraries listed below
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆113Updated 2 years ago
- Unofficial PyTorch Implementation for pNLP-Mixer: an Efficient all-MLP Architecture for Language (https://arxiv.org/abs/2202.04350)☆62Updated 3 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆211Updated 2 years ago
- Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".☆256Updated last year
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆97Updated last year
- This repo hosts the code and models of "Masked Autoencoders that Listen".☆581Updated last year
- ☆164Updated 2 years ago
- A simple cross attention that updates both the source and target in one step☆169Updated 11 months ago
- Implementation of Fast Transformer in Pytorch☆173Updated 3 years ago
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆36Updated last year
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆90Updated 10 months ago
- 🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps☆163Updated 2 months ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆144Updated last year
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)