google / video-localized-narrativesView external linksLinks
☆60Aug 10, 2023Updated 2 years ago
Alternatives and similar repositories for video-localized-narratives
Users that are interested in video-localized-narratives are comparing it to the libraries listed below
Sorting:
- ☆14Jul 15, 2024Updated last year
- ☆16May 10, 2023Updated 2 years ago
- ☆11Mar 1, 2023Updated 2 years ago
- ☆13Jul 20, 2024Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- ☆53Oct 16, 2023Updated 2 years ago
- ☆58Apr 24, 2024Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- SSL Video Representation Learning project☆14Jul 8, 2025Updated 7 months ago
- ☆12Mar 10, 2023Updated 2 years ago
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆14Apr 25, 2024Updated last year
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆11Jun 18, 2024Updated last year
- ☆16Apr 8, 2024Updated last year
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆17Dec 20, 2022Updated 3 years ago
- Implementation of Pix2Seq in PyTorch☆10Feb 3, 2022Updated 4 years ago
- This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts…☆290Feb 12, 2024Updated 2 years ago
- [ICCV 2023 Workshop] The Official Implementation of The First Prize Solution for RVOS Competition☆14Jan 1, 2024Updated 2 years ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 2 years ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Nov 7, 2023Updated 2 years ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated last year
- ☆18Jan 30, 2023Updated 3 years ago
- Official This-Is-My Dataset published in CVPR 2023☆16Jul 18, 2024Updated last year
- ☆41Sep 25, 2023Updated 2 years ago
- ☆42Jul 9, 2025Updated 7 months ago
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆70Feb 28, 2024Updated last year
- Official repo for "DynaMITe: Dynamic Query Bootstrapping for Multi-object Interactive Segmentation Transformer"☆19Sep 29, 2023Updated 2 years ago
- "DV360 Duplicator" is a custom DV360 solution that allows to duplicate existing marketing campaigns and adjust their settings in Google S…☆20Jun 11, 2025Updated 8 months ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago
- ☆16May 26, 2023Updated 2 years ago
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- ☆58Dec 2, 2025Updated 2 months ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- ☆242Jun 4, 2025Updated 8 months ago
- ☆46Jan 11, 2024Updated 2 years ago
- ☆24Feb 5, 2026Updated last week
- ☆17Jun 15, 2022Updated 3 years ago
- Rotation equivariance meets local feature matching☆18Oct 20, 2022Updated 3 years ago