AlenUbuntu / Awesome-Vision-and-Language-PreTrain-PapersView external linksLinks
☆14Dec 25, 2020Updated 5 years ago
Alternatives and similar repositories for Awesome-Vision-and-Language-PreTrain-Papers
Users that are interested in Awesome-Vision-and-Language-PreTrain-Papers are comparing it to the libraries listed below
Sorting:
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671☆18May 6, 2021Updated 4 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆27Aug 20, 2022Updated 3 years ago
- Official PyTorch code for the CVPR 2024 paper 'Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognitio…☆37May 28, 2025Updated 8 months ago
- To appear in the 30th International Joint Conference on Artificial Intelligence (IJCAI 2021).☆32Aug 18, 2021Updated 4 years ago
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆38Jan 17, 2024Updated 2 years ago
- ☆33Nov 12, 2018Updated 7 years ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Jul 14, 2021Updated 4 years ago
- Automating the labelling of microstructure patterns in microscope images of welding joints☆11Jun 10, 2018Updated 7 years ago
- Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)☆11Sep 17, 2023Updated 2 years ago
- ☆11Apr 8, 2024Updated last year
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- ☆10Dec 16, 2023Updated 2 years ago
- Re3 in PyTorch☆39Nov 19, 2023Updated 2 years ago
- This project is created in order to compare the differences between the convolutional neural network with and without Gabor's feature ext…☆11Nov 3, 2020Updated 5 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Jan 16, 2022Updated 4 years ago
- Latex template for CUHK PhD Thesis☆11Jun 29, 2025Updated 7 months ago
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Jan 16, 2024Updated 2 years ago
- ☆10Oct 17, 2023Updated 2 years ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 2 months ago
- ☆10Mar 28, 2023Updated 2 years ago
- Code for the article "Accelerated Forward-Backward Optimization using Deep Learning"☆12Sep 15, 2021Updated 4 years ago
- [PRCV-2023, IEEE TMM-2025] Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification☆12Dec 20, 2025Updated last month
- Official repository of the UPAR dataset for pedestrian attribute recognition and attribute-based person retrieval☆14Jan 22, 2024Updated 2 years ago
- Course review and timetable planning platform used by thousands of CUHK students☆13Aug 19, 2024Updated last year
- This repository summarizes the human-centered applications of event data☆13Jan 31, 2025Updated last year
- ☆11Mar 7, 2024Updated last year
- ☆10Sep 8, 2022Updated 3 years ago
- AI wiki☆10Dec 9, 2022Updated 3 years ago
- Unofficial PyTorch implementation of the paper "Multi-Label Image Recognition with Graph Convolutional Networks"☆10Feb 19, 2023Updated 2 years ago
- Fast Autoaugment implementation for PyTorch☆10Jul 24, 2019Updated 6 years ago
- ☆17Nov 16, 2025Updated 3 months ago
- Video to Language Challenge (MSR-VTT Challenge 2016)☆32Dec 28, 2017Updated 8 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆46Mar 3, 2021Updated 4 years ago
- Pytorch code for our NeurIPS 2019 paper "Cross-channel Communication Networks"☆41Dec 13, 2019Updated 6 years ago
- I3D feature extractor☆43Dec 26, 2019Updated 6 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆188May 1, 2025Updated 9 months ago
- ☆11Mar 5, 2025Updated 11 months ago