thunlp/PEVL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thunlp/PEVL)

thunlp / PEVL

Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”

☆49

Alternatives and similar repositories for PEVL

Users that are interested in PEVL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thunlp / VisualDS
View on GitHub
☆24Apr 16, 2022Updated 4 years ago
thunlp / CLEVER
View on GitHub
☆22Dec 12, 2022Updated 3 years ago
microsoft / UniTAB
View on GitHub
UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)
☆90Jun 12, 2023Updated 3 years ago
JacobYuan7 / RLIP
View on GitHub
[NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grap…
☆78May 26, 2024Updated 2 years ago
Kien085 / SG2Caps
View on GitHub
☆23Aug 21, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
waxnkw / IETrans-SGG.pytorch
View on GitHub
This is the code of ECCV 2022 (Oral) paper "Fine-Grained Scene Graph Generation with Data Transfer".
☆103Jan 24, 2023Updated 3 years ago
ezeli / Transformer_model
View on GitHub
A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.
☆12Nov 15, 2021Updated 4 years ago
sjtuytc / Neurips21-ProTo-Program-guided-Transformers-for-Program-guided-Tasks
View on GitHub
Official code repo for "ProTo: program-guided Transformers for Program-guided Tasks
☆21Apr 15, 2022Updated 4 years ago
ht014 / SG2HOI
View on GitHub
☆12Sep 19, 2021Updated 4 years ago
yuhangzang / UPT
View on GitHub
☆61May 2, 2025Updated last year
GingL / CMPA
View on GitHub
☆16May 31, 2023Updated 3 years ago
thunlp / CPT
View on GitHub
Colorful Prompt Tuning for Pre-trained Vision-Language Models
☆49Nov 1, 2022Updated 3 years ago
ubc-vision / RefTR
View on GitHub
Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021
☆67May 26, 2022Updated 4 years ago
LanqingL / SCS
View on GitHub
"Visual Prompt Selection for In-Context Learning Segmentation Framework"
☆14Dec 13, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Deanplayerljx / tab-vcr
View on GitHub
Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671
☆19May 6, 2021Updated 5 years ago
microsoft / FIBER
View on GitHub
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
☆131Oct 10, 2023Updated 2 years ago
iLearn-Lab / CVPR22-SHA-GCL-for-SGG
View on GitHub
Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"
☆39Apr 8, 2026Updated 3 months ago
djiajunustc / TransVG
View on GitHub
☆198Feb 27, 2024Updated 2 years ago
bknyaz / sgg
View on GitHub
Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization [BMVC 2020, ICCV …
☆143Jun 18, 2023Updated 3 years ago
maximek3 / e-ViL
View on GitHub
☆41Nov 23, 2022Updated 3 years ago
e-bug / fine-grained-evals
View on GitHub
[ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"
☆13Jun 11, 2023Updated 3 years ago
LandyGuo / Download_HowTo100M
View on GitHub
code for downloading videos from HowTo100M dataset
☆18May 13, 2021Updated 5 years ago
Vision-CAIR / RelTransformer
View on GitHub
☆29Oct 4, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
windforfurture / DTCA
View on GitHub
for DTCA model
☆10Oct 17, 2023Updated 2 years ago
meetdavidwan / crg
View on GitHub
PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"
☆39Mar 4, 2024Updated 2 years ago
XLiu443 / Tem-adapter
View on GitHub
[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
☆37Oct 18, 2023Updated 2 years ago
codexxxl / GraphVQA
View on GitHub
GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering
☆65Sep 4, 2021Updated 4 years ago
mengcaopku / DCNet
View on GitHub
[ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension
☆15Sep 4, 2022Updated 3 years ago
seanzhuh / SeqTR
View on GitHub
SeqTR: A Simple yet Universal Network for Visual Grounding
☆144Oct 30, 2024Updated last year
iacercalixto / butd-image-captioning
View on GitHub
Bottom-up Top-down image captioning model with PyTorch.
☆14Dec 5, 2020Updated 5 years ago
IIGROUP / PUM
View on GitHub
[CVPR 2021] Pytorch implementation for Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
☆19May 7, 2021Updated 5 years ago
UMass-Embodied-AGI / CoVLM
View on GitHub
[ICLR 2023] CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
☆46Jun 9, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HKUST-LongGroup / CoMM
View on GitHub
[CVPR 2025 Highlight] Official repository for CoMM Dataset
☆56Dec 31, 2024Updated last year
LYX0501 / SPRING
View on GitHub
☆13Mar 25, 2023Updated 3 years ago
dongkwani / UPCSC
View on GitHub
The official implementation of "Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization" (CVPR 2025)
☆15Nov 20, 2025Updated 8 months ago
ChCh1999 / RTPB
View on GitHub
Code for our paper `Resistance Training using Prior Bias: toward Unbiased Scene Graph Generation`
☆20Feb 18, 2024Updated 2 years ago
Hxyou / MSCLIP
View on GitHub
Official Code of ECCV 2022 paper MS-CLIP
☆91Jul 27, 2022Updated 3 years ago
zeeshank95 / GVSR
View on GitHub
☆14Dec 9, 2023Updated 2 years ago
kj3moraes / movieclip
View on GitHub
An experiment with movie scenes and contrastive learning
☆11Feb 1, 2025Updated last year