☆32Mar 7, 2022Updated 3 years ago
Alternatives and similar repositories for gpv2
Users that are interested in gpv2 are comparing it to the libraries listed below
Sorting:
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Jun 26, 2021Updated 4 years ago
- Knowledge Infused Decoding☆71Dec 31, 2023Updated 2 years ago
- Download Web-10K data by querying Bing Image Search☆10Feb 1, 2022Updated 4 years ago
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)☆13Jan 24, 2023Updated 3 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Jul 14, 2021Updated 4 years ago
- ☆35Oct 21, 2023Updated 2 years ago
- ☆18Jun 10, 2022Updated 3 years ago
- The Image Local Autoregressive Transformer (NIPS2021)☆15Nov 9, 2021Updated 4 years ago
- “Open terminals”, “load CSVs”, “start hacking”☆16May 2, 2017Updated 8 years ago
- ☆14May 31, 2022Updated 3 years ago
- PoE-World: Compositional World Modeling with Products of Programmatic Experts☆39Feb 5, 2026Updated 3 weeks ago
- FaVIQ: Fact Verification from Information-seeking Questions☆43Nov 23, 2022Updated 3 years ago
- PyTorch implementation of Gaussian word embeddings☆19Apr 7, 2018Updated 7 years ago
- Weakly Supervised Grounding for VQA in Vision-Language Transformers☆16May 6, 2023Updated 2 years ago
- PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)☆23Nov 29, 2022Updated 3 years ago
- Transformer model for the Amazon Topical-Chat Corpus. Baselines for DSTC9 Track 3.☆19Jul 9, 2020Updated 5 years ago
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆28Oct 12, 2021Updated 4 years ago
- ☆19Jul 6, 2023Updated 2 years ago
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Jul 8, 2021Updated 4 years ago
- UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)☆89Jun 12, 2023Updated 2 years ago
- baseline mode for the ObjectNet competition☆18Jan 13, 2021Updated 5 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Jun 13, 2023Updated 2 years ago
- Memory-Bounded GPU Acceleration for Vector Search☆33Dec 29, 2025Updated 2 months ago
- Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone☆131Oct 10, 2023Updated 2 years ago
- Finetune CPM-1☆24Jun 20, 2021Updated 4 years ago
- ☆49Mar 8, 2022Updated 3 years ago
- PyTorch Implementation of Spatially Consistent Representation Learning(SCRL)☆107Jan 3, 2024Updated 2 years ago
- ☆120Jun 11, 2024Updated last year
- ECCV2022 Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection☆27May 26, 2023Updated 2 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆116Sep 15, 2022Updated 3 years ago
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆113May 13, 2020Updated 5 years ago
- Code for SIGDial 2019 Best Paper: Structured Fusion Networks for Dialog https://arxiv.org/abs/1907.10016☆30Aug 19, 2019Updated 6 years ago
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.☆32Sep 26, 2023Updated 2 years ago
- OFA-Compress is a unified framework which provides OFA model finetuning, distillation and inference capabilities in Huggingface version, …☆29Sep 22, 2022Updated 3 years ago
- Code of ICCV paper: https://arxiv.org/abs/2011.10881☆79Nov 20, 2022Updated 3 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Apr 9, 2022Updated 3 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Jul 9, 2020Updated 5 years ago
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆77Jan 27, 2024Updated 2 years ago