kampta / PatchVAELinks
PyTorch implementation of "PatchVAE: Learning Local Latent Codes for Recognition" to appear in CVPR 2020
☆14Updated 5 years ago
Alternatives and similar repositories for PatchVAE
Users that are interested in PatchVAE are comparing it to the libraries listed below
Sorting:
- ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.☆62Updated 3 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆78Updated 3 years ago
- ☆66Updated 2 years ago
- Robust Contrastive Learning Using Negative Samples with Diminished Semantics (NeurIPS 2021)☆39Updated 3 years ago
- We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…☆47Updated 4 years ago
- PyTorch implementation of FiLM: Visual Reasoning with a General Conditioning Layer☆63Updated 5 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆52Updated 3 years ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆65Updated 3 years ago
- ☆30Updated 2 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆78Updated 3 years ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- ☆81Updated last year
- [NeurIPS'21] "Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly", Tianlong Chen, Yu Cheng, Zhe …☆84Updated 3 years ago
- Source code for the paper "Structured Attention Graphs for Understanding Deep Image Classifications"☆30Updated 3 years ago
- [TMLR 2022] High-Modality Multimodal Transformer☆117Updated 11 months ago
- ☆34Updated 2 years ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated last year
- ImageNetV2 Pytorch Dataset☆41Updated 2 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆48Updated last year
- PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.☆125Updated 3 years ago
- ☆120Updated 2 years ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆35Updated 3 years ago
- ☆16Updated 2 years ago
- ☆53Updated 2 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆25Updated 2 years ago
- Visual Representation Learning Benchmark for Self-Supervised Models☆35Updated last year
- ☆85Updated 3 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆90Updated 3 years ago