dmoltisanti / air-cvpr23View external linksLinks
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Measuring Verb-Adverb Textual Relationships"
☆13May 25, 2023Updated 2 years ago
Alternatives and similar repositories for air-cvpr23
Users that are interested in air-cvpr23 are comparing it to the libraries listed below
Sorting:
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆13Nov 4, 2023Updated 2 years ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Sep 5, 2023Updated 2 years ago
- ☆18Feb 20, 2025Updated 11 months ago
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆19Feb 16, 2024Updated 2 years ago
- ☆120Feb 19, 2024Updated last year
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆46Jul 26, 2024Updated last year
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆53Mar 3, 2024Updated last year
- ☆25May 19, 2022Updated 3 years ago
- Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024☆58Aug 19, 2025Updated 5 months ago
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆28Sep 23, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆31Aug 3, 2022Updated 3 years ago
- Generating Image Specific Text☆29Aug 14, 2023Updated 2 years ago
- ☆31Mar 24, 2022Updated 3 years ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆30Apr 19, 2023Updated 2 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"☆79May 5, 2024Updated last year
- ☆46Apr 30, 2021Updated 4 years ago
- ☆37Nov 8, 2024Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆80Nov 2, 2023Updated 2 years ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- ☆14Nov 26, 2025Updated 2 months ago
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated last year
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- ☆17Apr 25, 2023Updated 2 years ago
- The code repository for ICML25 paper "Understanding the Limits of Deep Tabular Methods with Temporal Shift"☆13May 1, 2025Updated 9 months ago
- ☆10Jul 16, 2023Updated 2 years ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆48Feb 29, 2024Updated last year
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- A curated list of papers & resources linked to concept learning☆12Aug 9, 2023Updated 2 years ago
- Basic Vacuum Cleaner World Problem in Artificial Intelligence that describes how any action is perfomed after sensing the environment.☆11Feb 11, 2019Updated 7 years ago
- ☆12Jan 21, 2019Updated 7 years ago
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- [ICCV 2025] Repository for A Quality-Guided Mixture of Score-fusion Experts Framework for Human Recognition☆16Sep 29, 2025Updated 4 months ago
- ☆11Nov 5, 2024Updated last year
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated last week