Xianchao-Wu / perceiver-pytorchLinks
☆46Updated 4 years ago
Alternatives and similar repositories for perceiver-pytorch
Users that are interested in perceiver-pytorch are comparing it to the libraries listed below
Sorting:
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆77Updated last year
- Official Code of ECCV 2022 paper MS-CLIP☆91Updated 3 years ago
- Keras implement of Finite Scalar Quantization☆83Updated 2 years ago
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Updated 4 years ago
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆197Updated 3 years ago
- code for Explicit Sparse Transformer☆61Updated 2 years ago
- ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.☆62Updated 4 years ago
- ☆17Updated 2 years ago
- This repository contains the code for our ECCV 2022 paper "Temporal and cross-modal attention for audio-visual zero-shot learning"☆25Updated 4 months ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆53Updated 3 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 4 years ago
- ☆85Updated 3 years ago
- Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch☆71Updated 5 years ago
- ☆73Updated 3 years ago
- ☆26Updated 4 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 3 years ago
- Generalized Deep Metric Learning.☆36Updated 3 years ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated 2 years ago
- [ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, M…☆27Updated 11 months ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".☆58Updated 2 years ago
- PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)☆124Updated 2 years ago
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆29Updated 4 years ago
- ☆12Updated 2 years ago
- For paper《Gaussian Transformer: A Lightweight Approach for Natural Language Inference》☆28Updated 5 years ago
- ☆31Updated 4 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Updated 3 years ago
- Code for the EMNLP 2021 Oral paper "Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search" https://arx…☆12Updated 2 years ago
- ☆23Updated 5 years ago
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆102Updated last year