a-nagrani / CVPR2020_PosterView external linksLinks
Speech2Action CVPR Poster Source Code
☆20Apr 29, 2020Updated 5 years ago
Alternatives and similar repositories for CVPR2020_Poster
Users that are interested in CVPR2020_Poster are comparing it to the libraries listed below
Sorting:
- LaTex Poster for SDPS-Net (CVPR 2019)☆36Jun 11, 2019Updated 6 years ago
- LaTex Poster for TOM-Net (CVPR 2018)☆47Jul 31, 2018Updated 7 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 3 years ago
- LaTeX Poster and Slides for AMP (CVPR 2021)☆32May 31, 2021Updated 4 years ago
- C3P_code☆11Sep 30, 2022Updated 3 years ago
- [CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".☆54May 25, 2025Updated 8 months ago
- Course review and timetable planning platform used by thousands of CUHK students☆13Aug 19, 2024Updated last year
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year
- Latex template for CUHK PhD Thesis☆11Jun 29, 2025Updated 7 months ago
- ☆10Oct 24, 2024Updated last year
- ☆65May 31, 2025Updated 8 months ago
- [IEEE TVT] FII-CenterNet: an anchor-free detector with foreground attention for traffic object detection☆13Jun 11, 2021Updated 4 years ago
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Nov 7, 2024Updated last year
- ☆11Jul 16, 2024Updated last year
- pytorch implementation of Semantics-AssistedVideoCaptioning☆11Feb 16, 2023Updated 2 years ago
- ☆15Sep 23, 2024Updated last year
- Reading list for multimodal sequence learning☆14Sep 4, 2023Updated 2 years ago
- ☆10Jun 30, 2023Updated 2 years ago
- ☆12Mar 29, 2024Updated last year
- UCF Sports annotations: This repository provides human bounding box annotations of UCF Sports dataset and a function to read these annota…☆14May 22, 2015Updated 10 years ago
- ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding☆17Aug 8, 2025Updated 6 months ago
- ☆11Jan 6, 2025Updated last year
- video captioning using 3DCNN and LSTM (pytorch)☆11Sep 26, 2019Updated 6 years ago
- ☆14Dec 8, 2025Updated 2 months ago
- Micromodels -- A framework for accurate, explainable, data efficient, and reusable NLP models.☆14Feb 7, 2023Updated 3 years ago
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆29Jan 18, 2026Updated 3 weeks ago
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆14Apr 2, 2025Updated 10 months ago
- ☆14Dec 25, 2020Updated 5 years ago
- ☆15Jun 12, 2022Updated 3 years ago
- ☆11Aug 31, 2023Updated 2 years ago
- ☆33Nov 26, 2025Updated 2 months ago
- SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability☆16May 8, 2025Updated 9 months ago
- Trying to understand alias-free-gan.☆14Dec 28, 2021Updated 4 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Jul 17, 2017Updated 8 years ago
- Code for our Source-free Unsupervised Video Domain Adaptation Paper☆13Jan 17, 2025Updated last year
- Code for "Improving Robustness of Vision Transformers by Reducing Sensitivity to Patch Corruptions"☆14Sep 3, 2023Updated 2 years ago
- ☆17Sep 6, 2024Updated last year
- Official implementation of "An Action Is Worth Multiple Words: Handling Ambiguity in Action Recognition", BMVC 2022☆12Dec 16, 2022Updated 3 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago