a-nagrani / CVPR2020_Poster
Speech2Action CVPR Poster Source Code
☆19Updated 4 years ago
Alternatives and similar repositories for CVPR2020_Poster:
Users that are interested in CVPR2020_Poster are comparing it to the libraries listed below
- [ECCV 2022] Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation☆19Updated 2 years ago
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆32Updated 2 years ago
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency☆17Updated 3 years ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆34Updated 2 years ago
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning☆33Updated 2 years ago
- CCVS: Context-aware Controllable Video Synthesis☆22Updated 3 years ago
- CVPR 2021 VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild☆29Updated 2 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 2 years ago
- ☆38Updated last year
- ☆62Updated 3 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- Official PyTorch implementation of MultiSiam in ICCV 2021 (https://arxiv.org/abs/2108.12178)☆22Updated 3 years ago
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"☆56Updated last year
- [CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)☆52Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Updated last year
- ☆57Updated last year
- MIST: Multiple Instance Spatial Transformer☆25Updated 3 years ago
- [NeurIPS'22] ReCo: Retrieve and Co-segment for Zero-shot Transfer☆61Updated last year
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated last year
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- Teach-DETR: Better Training DETR with Teachers☆30Updated 10 months ago
- Posters for all 235 cvpr2023 highlight papers☆27Updated 6 months ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆33Updated 3 years ago
- Official codes for ConMIM (ICLR 2023)☆58Updated last year
- ☆18Updated 2 years ago
- Video Instance Segmentation using Inter-Frame Communication Transformers (NeurIPS 2021)☆90Updated last year
- Self-supervised Point Cloud Representation Learning via Separating Mixed Shapes☆19Updated last year
- PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation☆25Updated 3 years ago
- ☆11Updated 3 years ago