a-nagrani / CVPR2020_PosterLinks
Speech2Action CVPR Poster Source Code
☆19Updated 5 years ago
Alternatives and similar repositories for CVPR2020_Poster
Users that are interested in CVPR2020_Poster are comparing it to the libraries listed below
Sorting:
- ☆61Updated 3 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 2 years ago
- Test-Time Training on Video Streams☆64Updated 2 years ago
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency☆17Updated 3 years ago
- CVPR2022: Large-scale Video Panoptic Segmentation in the Wild: A Benchmark☆144Updated 2 years ago
- [ECCV 2022] Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation☆19Updated 3 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆31Updated 2 years ago
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"☆61Updated 2 years ago
- [CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)☆51Updated 3 years ago
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning☆36Updated 3 years ago
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Updated 2 years ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Updated 2 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆34Updated 4 years ago
- A PyTorch implementation of TVC☆24Updated last year
- CCVS: Context-aware Controllable Video Synthesis☆22Updated 3 years ago
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Updated 2 years ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆108Updated last year
- ☆64Updated last year
- Video Instance Segmentation using Inter-Frame Communication Transformers (NeurIPS 2021)☆93Updated last year
- Official implementation of "Can Language Understand Depth?"☆82Updated 2 years ago
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆135Updated last week
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆85Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆34Updated 3 years ago
- Official PyTorch implementation of MultiSiam in ICCV 2021 (https://arxiv.org/abs/2108.12178)☆22Updated 3 years ago
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆111Updated 5 years ago
- Code repository for "It's About Time: Analog clock Reading in the Wild"☆78Updated last year
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆118Updated last year
- Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective, in ICCV 2021 (Oral)☆146Updated 3 years ago