showlab/DemoVLP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/showlab/DemoVLP)

showlab / DemoVLP

[Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training

☆22

Alternatives and similar repositories for DemoVLP

Users that are interested in DemoVLP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FingerRec / OA-Transformer
View on GitHub
[CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》
☆61May 25, 2022Updated 4 years ago
tsujuifu / pytorch_violet
View on GitHub
A PyTorch implementation of VIOLET
☆138Dec 17, 2023Updated 2 years ago
showlab / Region_Learner
View on GitHub
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆43Jul 15, 2022Updated 4 years ago
TencentARC / MCQ
View on GitHub
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
☆141Jul 20, 2022Updated 4 years ago
showlab / all-in-one
View on GitHub
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
☆281Mar 25, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
princetonvisualai / MQVR
View on GitHub
☆26Jan 12, 2022Updated 4 years ago
xlliu7 / BMN-Boundary-Matching-Network
View on GitHub
A Fast PyTorch implementation for ICCV 19 paper "BMN: Boundary-Matching Network for Temporal Action Proposal Generation"
☆10Jul 29, 2019Updated 6 years ago
jaeseokbyun / GRIT-VLP
View on GitHub
This is an official implementation of GRIT-VLP
☆20Aug 8, 2022Updated 3 years ago
CuthbertCai / Ask-Confirm
View on GitHub
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)
☆20Dec 4, 2021Updated 4 years ago
airsplay / vimpac
View on GitHub
☆73Jun 3, 2022Updated 4 years ago
CSU-JPG / Awesome-VLM-Reasoning
View on GitHub
☆21May 19, 2025Updated last year
StanLei52 / TQVSR
View on GitHub
[Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant
☆24Sep 11, 2023Updated 2 years ago
salesforce / ALPRO
View on GitHub
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
☆188May 1, 2025Updated last year
bellos1203 / STPN
View on GitHub
STPN - Weakly Supervised Action Localization by Sparse Temporal Pooling Network
☆82Dec 6, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sjenni / temporal-ssl
View on GitHub
Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.
☆49Mar 18, 2021Updated 5 years ago
redwang / DTGRM
View on GitHub
Temporal Relational Modeling with Self-Supervision for Action Segmentation
☆20Feb 7, 2021Updated 5 years ago
lingorX / LIIR
View on GitHub
☆17Jun 21, 2022Updated 4 years ago
VALUE-Leaderboard / StarterCode
View on GitHub
Starter Code for VALUE benchmark
☆79Aug 23, 2022Updated 3 years ago
TencentARC / SFDA
View on GitHub
☆21Jul 20, 2022Updated 4 years ago
lixiaotong97 / mc-BEiT
View on GitHub
[ECCV 2022] Official pytorch implementation of "mc-BEiT: Multi-choice Discretization for Image BERT Pre-training" in European Conference …
☆22Sep 13, 2022Updated 3 years ago
guilk / VLC
View on GitHub
Research code for "Training Vision-Language Transformers from Captions Alone"
☆33Jul 15, 2022Updated 4 years ago
microsoft / LAVENDER
View on GitHub
A Unified Framework for Video-Language Understanding
☆62Jun 17, 2023Updated 3 years ago
TengdaHan / TemporalAlignNet
View on GitHub
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
☆122Oct 9, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
AndresPMD / Pytorch-yolo-phoc
View on GitHub
Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval
☆13Dec 15, 2021Updated 4 years ago
TencentYoutuResearch / SelfSupervisedLearning-DSM
View on GitHub
code for AAAI21 paper "Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion“
☆28Jan 7, 2021Updated 5 years ago
sdc17 / CrossGET
View on GitHub
[ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
☆34Dec 30, 2024Updated last year
zhang-can / UP-TAL
View on GitHub
[CVPR2022] Unsupervised Pre-training for Temporal Action Localization Tasks (UP-TAL)
☆29Mar 9, 2022Updated 4 years ago
camenduru / Video-LLaMA-colab
View on GitHub
☆31Jul 25, 2023Updated 2 years ago
jinxiang-liu / anno-free-AVS
View on GitHub
Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"
☆38Oct 11, 2024Updated last year
jiyt17 / IDA-VLM
View on GitHub
[ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
☆37Nov 27, 2024Updated last year
medhini / clip_it
View on GitHub
CLIP-It! Language-Guided Video Summarization
☆75Jun 21, 2021Updated 5 years ago
CSU-JPG / TextAtlas
View on GitHub
[ICML 2026]A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation
☆93Sep 27, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xiaoneil / LPNet
View on GitHub
☆13Nov 28, 2021Updated 4 years ago
kahnchana / svt
View on GitHub
Official repository for "Self-Supervised Video Transformer" (CVPR'22)
☆109Jun 26, 2024Updated 2 years ago
zhujiagang / DTPP
View on GitHub
Deep networks with Temporal Pyramid Pooling. The official implementation for "End-to-end Video-level Representation Learning for Action R…
☆73May 6, 2019Updated 7 years ago
chenjoya / 2dtan
View on GitHub
An optimized re-implementation for 2D-TAN: Learning 2D Temporal Localization Networks for Moment Localization with Natural Language (AAAI…
☆128Apr 1, 2023Updated 3 years ago
AwalkZY / CPN
View on GitHub
Code for CVPR2021 Paper “Cascaded Prediction Network via Segment Tree for Temporal Video Grounding”
☆10Apr 3, 2022Updated 4 years ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
SongYii / awesome-weakly-supervised-object-detection
View on GitHub
A paper list of Weakly Supervised Object Detection (WSOD) resources.
☆13May 6, 2021Updated 5 years ago