pytorch implementation of mvp: a multi-stage vision-language pre-training framework
☆34Mar 1, 2023Updated 3 years ago
Alternatives and similar repositories for mvp_pytorch
Users that are interested in mvp_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch implementation of mvp: a multi-stage vision-language pre-training framework☆11Apr 23, 2022Updated 3 years ago
- Controllable mage captioning model with unsupervised modes☆21Apr 14, 2023Updated 2 years ago
- SelfCriticalSequenceTrainingforImageCaptioning☆21May 27, 2017Updated 8 years ago
- Multi-index hashing for the resolution of ANN search problem on large datasets☆15Oct 16, 2018Updated 7 years ago
- ☆60Nov 17, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Dec 9, 2021Updated 4 years ago
- Implementation of our ACL2023 paper: Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Langua…☆19Jul 5, 2023Updated 2 years ago
- ☆10Aug 20, 2024Updated last year
- ☆12Feb 18, 2020Updated 6 years ago
- ☆28Feb 2, 2026Updated last month
- Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos☆12Oct 8, 2020Updated 5 years ago
- Fast and Modularized CFG-focused Models☆23Nov 8, 2023Updated 2 years ago
- pre-trained vision and language model summary☆12Apr 20, 2021Updated 4 years ago
- The Transformer in PyTorch☆13Aug 7, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Repository for the ACL'22 paper "So Different Yet So Alike! Constrained Unsupervised Text Style Transfer"☆16Jan 19, 2024Updated 2 years ago
- Proximal Asynchronous SAGA☆13Nov 30, 2017Updated 8 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- By fine tuning GPT2 on News Aggregator data☆15Jan 24, 2021Updated 5 years ago
- ☆13Jun 26, 2021Updated 4 years ago
- project page for VinVL☆359Jul 26, 2023Updated 2 years ago
- ☆13Mar 27, 2019Updated 6 years ago
- Poincaré Event Temporal Embeddings and Hyperbolic GRU for Event TempRel Extraction☆11Nov 8, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆13Feb 1, 2022Updated 4 years ago
- Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".☆109Aug 2, 2022Updated 3 years ago
- ☆13Jun 17, 2024Updated last year
- Oscar and VinVL☆1,052Aug 28, 2023Updated 2 years ago
- ☆28Feb 10, 2025Updated last year
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- The download methods of Vision-language Continual Pretraining Dataset P9D.☆12Jan 3, 2025Updated last year
- A Sparse-tensor Communication Framework for Distributed Deep Learning☆13Nov 1, 2021Updated 4 years ago
- [NeurIPS 2024] SeeClear: This repo is the official implementation of "SeeClear: Semantic Distillation Enhances Pixel Condensation for Vid…☆18Oct 8, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of CVPR2017 paper "A Hierarchical Approach for Generating Descriptive Image Paragraphs" in Tensorflow (in progress...)☆13Jan 27, 2018Updated 8 years ago
- A Full-Scale Dataset for Multi-modal Summarization☆16Dec 8, 2021Updated 4 years ago
- [NeurIPS 2022 Workshop] A Case Study with Negated Prompts using T0 (3B, 11B), InstructGPT (350M-175B), GPT-3 (350M - 175B) & OPT (125M - …☆24Sep 27, 2022Updated 3 years ago
- ☆20Nov 19, 2020Updated 5 years ago
- [ECCV 2022] FashionViL: Fashion-Focused V+L Representation Learning☆61Nov 15, 2022Updated 3 years ago
- ☆36Dec 22, 2021Updated 4 years ago
- ☆10Jul 30, 2021Updated 4 years ago