This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Measuring Verb-Adverb Textual Relationships"
☆13May 25, 2023Updated 2 years ago
Alternatives and similar repositories for air-cvpr23
Users that are interested in air-cvpr23 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆13Nov 4, 2023Updated 2 years ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Sep 5, 2023Updated 2 years ago
- ☆18Feb 20, 2025Updated last year
- ☆26May 19, 2022Updated 3 years ago
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆46Jul 26, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆119Feb 19, 2024Updated 2 years ago
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆20Feb 16, 2024Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆53Mar 3, 2024Updated 2 years ago
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆29Sep 23, 2024Updated last year
- Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024☆58Aug 19, 2025Updated 7 months ago
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆31Aug 3, 2022Updated 3 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- ☆31Mar 24, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Generating Image Specific Text☆29Aug 14, 2023Updated 2 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆30Apr 19, 2023Updated 2 years ago
- Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"☆80May 5, 2024Updated last year
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆15Dec 5, 2023Updated 2 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆47Feb 29, 2024Updated 2 years ago
- Code and annotation for the paper "Towards Accurate and Interpretable Surgical Skill Assessment: A Video-Based Method Incorporating Recog…☆12Jan 20, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆80Nov 2, 2023Updated 2 years ago
- ☆46Apr 30, 2021Updated 4 years ago
- Continual Learning with Gated Incremental Memories for Sequential Data Processing. IJCNN 2020. Continual Learning with Recurrent Neural N…☆15Oct 13, 2021Updated 4 years ago
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated last year
- ☆25Oct 9, 2025Updated 5 months ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆37Apr 27, 2024Updated last year
- ☆11Nov 5, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆37Nov 8, 2024Updated last year
- ☆13Nov 28, 2021Updated 4 years ago
- Embodied Instruction Following in Unknown Environments☆17Dec 8, 2025Updated 3 months ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆20Nov 3, 2025Updated 4 months ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Weakly Supervised Object Localization via Class RE-Activation Mapping☆12Sep 19, 2022Updated 3 years ago