gaasher / data2vec2.0_vision
Implementation of Dat2Vec2.0 for vision
☆16Updated last year
Related projects ⓘ
Alternatives and complementary repositories for data2vec2.0_vision
- A 1D implementation of a deformable convolutional layer in PyTorch with a few tricks.☆36Updated last year
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆181Updated last year
- PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI☆172Updated last year
- PyTorch implementation of Representation Learning with Contrastive Predictive Coding by Van den Oord et al. (2018)☆82Updated 2 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆87Updated 4 months ago
- Official repository for "Orthogonal Projection Loss" (ICCV'21)☆115Updated 2 years ago
- Implementations of Recent Papers in Computer Vision☆39Updated 2 years ago
- [BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"☆142Updated last year
- Implementation of ResMLP, an all MLP solution to image classification, in Pytorch☆196Updated last year
- Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆28Updated last year
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆86Updated 2 years ago
- Deep Learning Model for Signal Data☆83Updated 5 years ago
- A Domain-Agnostic Benchmark for Self-Supervised Learning☆107Updated last year
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…☆16Updated 11 months ago
- ☆24Updated 2 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆115Updated last year
- Implementation of self-supervised image-level contrastive pretraining methods using Keras.☆69Updated 3 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆25Updated 4 years ago
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆122Updated 9 months ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 2 years ago
- Official codes: Self-Supervised Learning by Estimating Twin Class Distribution☆96Updated 2 years ago
- ☆164Updated last year
- A simple cross attention that updates both the source and target in one step☆150Updated 6 months ago
- Implementation of Visual Transformer for Small-size Datasets☆117Updated 2 years ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆188Updated 3 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆140Updated last year
- A TensorFlow 2.x implementation of Masked Autoencoders Are Scalable Vision Learners☆76Updated 2 years ago
- Implementation of ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks, ICML 2021.☆140Updated 3 years ago
- Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.o…☆44Updated 3 years ago
- PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes☆48Updated 6 months ago