hughplay/TVR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hughplay/TVR)

hughplay / TVR

Transformation Driven Visual Reasoning - CVPR 2021

☆36

Alternatives and similar repositories for TVR

Users that are interested in TVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Cognition2Action-Lab / VLA-TMEE
View on GitHub
Reshaping Action Error Distributions for Reliable Vision-Language-Action Models
☆17Feb 5, 2026Updated 5 months ago
belindal / LaMPP
View on GitHub
Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action
☆37Apr 3, 2023Updated 3 years ago
hughplay / Visual-Reasoning-Papers
View on GitHub
📄 A curated list of visual reasoning papers.
☆31Jul 1, 2026Updated 3 weeks ago
zfchenUnique / DCL-Release
View on GitHub
This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).
☆37Jul 8, 2024Updated 2 years ago
WellyZhang / ACRE
View on GitHub
ACRE: Abstract Causal REasoning Beyond Covariation
☆19Dec 7, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
microsoft / DFOL-VQA
View on GitHub
Differentiable First-Order Logic Reasoning for Visual Question Answering
☆45Mar 7, 2021Updated 5 years ago
bcmi / Causal-VidQA
View on GitHub
[CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The…
☆78Jun 23, 2025Updated last year
wangpengnorman / KB-Ref_dataset
View on GitHub
☆16Dec 28, 2020Updated 5 years ago
flowersteam / playground_env
View on GitHub
Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.
☆11Mar 5, 2021Updated 5 years ago
YYJMJC / Compositional-Temporal-Grounding
View on GitHub
☆31Mar 24, 2022Updated 4 years ago
Alvin-Zeng / GCM
View on GitHub
Graph Convolutional Module for Temporal Action Localization in Videos
☆10Jul 4, 2020Updated 6 years ago
lixiangpengcs / PSAC
View on GitHub
Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
☆27Apr 15, 2021Updated 5 years ago
mjd3 / deformable_object_grasping
View on GitHub
This package provides a framework to automatically perform grasp tests on an arbitrary object model of choice.
☆12Sep 28, 2021Updated 4 years ago
seukgcode / MELBench
View on GitHub
Multimodal entity linking (MEL) aims to utilize multimodal information to map mentions to corresponding entities defined in knowledge bas…
☆86Jul 31, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
xukai92 / bsp
View on GitHub
☆19Apr 29, 2022Updated 4 years ago
moqingyan / dsr-lm
View on GitHub
☆13Jul 8, 2023Updated 3 years ago
zfchenUnique / compositional_physics_learner
View on GitHub
☆40Jul 19, 2022Updated 4 years ago
evelinehong / PTR
View on GitHub
Official Repository of NeurIPS2021 paper: PTR
☆32Dec 17, 2021Updated 4 years ago
VividLe / ExtractVideoFeature
View on GitHub
Extract video features. Currently, the models includes I3D, will be continuously updated.
☆12Jun 4, 2020Updated 6 years ago
bpiyush / TestOfTime
View on GitHub
Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time
☆46Jun 11, 2024Updated 2 years ago
Adam1679 / mutan-article-net
View on GitHub
Implementation of Mutan+ArticleNet on OKVQA
☆10Jan 11, 2021Updated 5 years ago
microsoft / nlu-incremental-symbol-learning
View on GitHub
incremental symbol learning for natural language understanding
☆10Jun 12, 2023Updated 3 years ago
princeton-computational-imaging / MaskToF
View on GitHub
Official code repository for the paper: "Mask-ToF: Learning Microlens Masks for Flying Pixel Correction in Time-of-Flight Imaging"
☆16Jan 18, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
salesforce / woad-pytorch
View on GitHub
This is the pytorch implementation of WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos (CVPR2021).
☆13May 1, 2025Updated last year
2snoopy88 / GAT-with-batch
View on GitHub
implement gat with batch
☆10Nov 28, 2020Updated 5 years ago
cambridge-mlg / LITE
View on GitHub
Code for "Memory Efficient Meta-Learning with Large Images"
☆11Nov 24, 2021Updated 4 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
phddamuge / UniRPG
View on GitHub
This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".
☆10Apr 30, 2023Updated 3 years ago
BackupGithub-AI / LAH
View on GitHub
☆10Mar 28, 2023Updated 3 years ago
jayleicn / VideoLanguageFuturePred
View on GitHub
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
☆52Aug 20, 2022Updated 3 years ago
THU-KEG / R-Eval
View on GitHub
[KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models
☆11Apr 9, 2024Updated 2 years ago
roeiherz / AG2Video
View on GitHub
Code for "Compositional Video Synthesis with Action Graphs", Bar & Herzig et al., ICML 2021
☆32Nov 22, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
willprice / flowty
View on GitHub
The swiss army knife for extracting optical flow
☆16May 13, 2020Updated 6 years ago
jalayrac / object-states-action
View on GitHub
Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017
☆14Aug 7, 2018Updated 7 years ago
dmoltisanti / air-cvpr23
View on GitHub
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13May 25, 2023Updated 3 years ago
ict-bigdatalab / VNEL
View on GitHub
Dataset and code for EMNLP 2022 "Visual Named Entity Linking: A New Dataset and A Baseline"
☆28Apr 16, 2023Updated 3 years ago
soCzech / ChangeIt
View on GitHub
ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022
☆11Mar 23, 2022Updated 4 years ago
pl8787 / wsdm2021-beyond-prp-tutorial
View on GitHub
WSDM2021 Tutorial: Beyond Probability Ranking Principle: Modeling the Dependencies among Documents
☆23Mar 12, 2021Updated 5 years ago
MeteSertkan / ranger
View on GitHub
Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!
☆12Jun 12, 2023Updated 3 years ago