Code for Look for the Change paper published at CVPR 2022
☆36Oct 26, 2022Updated 3 years ago
Alternatives and similar repositories for LookForTheChange
Users that are interested in LookForTheChange are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Multi-Task Learning of Object States and State-Modifying Actions from Web Videos" published in TPAMI☆11Mar 3, 2024Updated 2 years ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 4 years ago
- Unifying Specialized Visual Encoders for Video Language Models☆25Nov 22, 2025Updated 6 months ago
- ☆17Sep 2, 2023Updated 2 years ago
- Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022☆39Feb 17, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [EMNLP'22] Weakly-Supervised Temporal Article Grounding☆14Nov 25, 2023Updated 2 years ago
- [NeurIPS 2021] Introspective Distillation for Robust Question Answering☆13Dec 7, 2021Updated 4 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆117Sep 15, 2022Updated 3 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- ☆34Apr 23, 2026Updated last month
- Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)☆23Jun 26, 2020Updated 5 years ago
- [ICLR 2024 Poster] SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos☆20Aug 21, 2025Updated 9 months ago
- Learned User Representations in Online Social Networks (Twitter) using Temporal Dynamics of Information Diffusion.☆10Oct 15, 2018Updated 7 years ago
- Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"☆30Jul 4, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2024] Towards Robust Multi-Modal Reasoning via Model Selection☆14Mar 7, 2024Updated 2 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 3 years ago
- BottomUpTopDown VQA model with question-type debiasing☆22Oct 6, 2019Updated 6 years ago
- 🔥🔥🔥 Object State Description & Change Detection☆10Apr 6, 2026Updated 2 months ago
- Unpaired Image Captioning☆36Mar 25, 2021Updated 5 years ago
- Official Implementation for CVPR 2022 paper "Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language …☆24Oct 19, 2022Updated 3 years ago
- Official implementation of the paper How to Listen? Rethinking Visual Sound Localization☆18Apr 25, 2022Updated 4 years ago
- [CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"☆55Aug 8, 2023Updated 2 years ago
- A pip installable package for optimal transport inspired loss functions in the spectral domain. Can be used for audio applications such a…☆30Apr 3, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 知识表示和推理项目,收集知识表示和推理算法,部分算法给出了应用案例。☆13Apr 26, 2022Updated 4 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Jun 29, 2020Updated 5 years ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago
- O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning (CoRL 2021)☆37Feb 22, 2022Updated 4 years ago
- Code for paper titled, "Learning to Predict Task Progress by Self-Supervised Video Alignment" by Gerard Donahue and Ehsan Elhamifar, publ…☆16Jul 26, 2024Updated last year
- Observer Pattern in Python - code examples☆14Jul 6, 2015Updated 10 years ago
- CLIP GUI - XAI app ~ explainable (and guessable) AI with ViT & ResNet models☆22Sep 13, 2024Updated last year
- ☆27Oct 7, 2021Updated 4 years ago
- ☆15Aug 20, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆121Oct 9, 2023Updated 2 years ago
- [ICLR2025] Official code implementation of Video-UTR: Unhackable Temporal Rewarding for Scalable Video MLLMs☆60Feb 27, 2025Updated last year
- GQA-OOD is a new dataset and benchmark for the evaluation of VQA models in OOD (out of distribution) settings.☆33Mar 1, 2021Updated 5 years ago
- ☆33Oct 16, 2017Updated 8 years ago
- Re-implementation of the work Livebot☆16Jun 21, 2020Updated 5 years ago
- ☆12Mar 8, 2021Updated 5 years ago
- Official Implementation of SnAG (CVPR 2024)☆60Apr 26, 2025Updated last year