allenai/gpv2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allenai/gpv2)

allenai / gpv2

☆32

Alternatives and similar repositories for gpv2

Users that are interested in gpv2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allenai / gpv2-web10k
View on GitHub
Download Web-10K data by querying Bing Image Search
☆10Feb 1, 2022Updated 4 years ago
allenai / gpv-1
View on GitHub
A task-agnostic vision-language architecture as a step towards General Purpose Vision
☆92Jul 14, 2021Updated 5 years ago
princeton-nlp / rationale-robustness
View on GitHub
NAACL 2022: Can Rationalization Improve Robustness? https://arxiv.org/abs/2204.11790
☆27Nov 21, 2022Updated 3 years ago
jason9693 / FROZEN
View on GitHub
☆14May 3, 2022Updated 4 years ago
ewrfcas / iLAT
View on GitHub
The Image Local Autoregressive Transformer (NIPS2021)
☆15Nov 9, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
RobertCsordas / ndr
View on GitHub
The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".
☆34Jun 11, 2025Updated last year
aurooj / WSG-VQA-VLTransformers
View on GitHub
Weakly Supervised Grounding for VQA in Vision-Language Transformers
☆17May 6, 2023Updated 3 years ago
IDEA-Research / DQ-DETR
View on GitHub
[AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
☆58Nov 28, 2022Updated 3 years ago
mjbommar / gpt-as-knowledge-worker
View on GitHub
GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)
☆13Jan 24, 2023Updated 3 years ago
facebookresearch / concurrentqa
View on GitHub
This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."
☆47Jul 18, 2024Updated 2 years ago
elastic / workplace-search-python
View on GitHub
Elastic Workplace Search Official Python Client
☆10Aug 8, 2024Updated last year
Vision-CAIR / LTVRR
View on GitHub
☆35Oct 21, 2023Updated 2 years ago
GuessWhatGame / vqa
View on GitHub
VQA baseline with Conditional Batch Normalization
☆15Apr 9, 2018Updated 8 years ago
fudan-zvg / TDAS
View on GitHub
☆18Jun 10, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
uakarsh / latr
View on GitHub
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…
☆56Updated this week
ronghanghu / vqa-maskrcnn-benchmark-m4c
View on GitHub
Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…
☆13Jan 30, 2020Updated 6 years ago
alexandra-chron / hierarchical-domain-adaptation
View on GitHub
Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.
☆32Sep 26, 2023Updated 2 years ago
Daniellli / ECT
View on GitHub
the official repository of 《ECT: Fine-grained Edge Detection with Learned Cause Tokens》
☆16Feb 15, 2024Updated 2 years ago
due-benchmark / du-schema
View on GitHub
JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…
☆14Nov 5, 2024Updated last year
salesforce / Converse
View on GitHub
☆132Jun 2, 2026Updated last month
junctor / android
View on GitHub
DEF CON Hacker Tracker
☆16Updated this week
rll-research / teachable
View on GitHub
☆17Oct 12, 2023Updated 2 years ago
MuchHair / HQM
View on GitHub
ECCV2022 Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection
☆27May 26, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
OFA-Sys / OFA-Compress
View on GitHub
OFA-Compress is a unified framework which provides OFA model finetuning, distillation and inference capabilities in Huggingface version, …
☆29Sep 22, 2022Updated 3 years ago
salesforce / VD-BERT
View on GitHub
☆45Jun 16, 2025Updated last year
microsoft / FIBER
View on GitHub
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
☆131Oct 10, 2023Updated 2 years ago
mnamysl / nat-acl2020
View on GitHub
☆15May 26, 2021Updated 5 years ago
qagentur / texttunnel
View on GitHub
Python package for extractive NLP using the OpenAI API
☆17Aug 28, 2024Updated last year
facebookresearch / paco
View on GitHub
This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts…
☆300Feb 12, 2024Updated 2 years ago
allenai / ask4help
View on GitHub
Code for the Ask4Help project
☆22Nov 24, 2022Updated 3 years ago
allenai / everyday-things
View on GitHub
☆17Dec 6, 2023Updated 2 years ago
sarthmit / Compositional-Attention
View on GitHub
Code to reproduce the results for Compositional Attention
☆59Nov 16, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
cl-tohoku / AIO2_DPR_baseline
View on GitHub
https://www.nlp.ecei.tohoku.ac.jp/projects/aio/
☆16Aug 4, 2022Updated 3 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
mlfoundations / patching
View on GitHub
Patching open-vocabulary models by interpolating weights
☆91Sep 28, 2023Updated 2 years ago
srush / transformers-bet
View on GitHub
☆12Mar 3, 2022Updated 4 years ago
agneet42 / revision
View on GitHub
[ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"
☆13Aug 6, 2024Updated last year
cvzoya / visuallydata
View on GitHub
A large-scale infographics dataset from Visual.ly with metadata and additional crowdsourced annotations
☆16Oct 8, 2018Updated 7 years ago
hspark-umn / MulticameraSoftware
View on GitHub
☆12Jul 18, 2017Updated 9 years ago