Official This-Is-My Dataset published in CVPR 2023
☆16Jul 18, 2024Updated last year
Alternatives and similar repositories for this-is-my
Users that are interested in this-is-my are comparing it to the libraries listed below
Sorting:
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 2 years ago
- ☆11Jul 31, 2022Updated 3 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago
- showing how to use CLIP-Vip to do video search☆16Nov 16, 2023Updated 2 years ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆45May 3, 2022Updated 3 years ago
- ActMAD: Activation Matching to Align Distributions for Test-Time-Training (CVPR 2023)☆21Jun 27, 2023Updated 2 years ago
- ☆54Jul 31, 2022Updated 3 years ago
- collection of pitch (f0, fundamental frequency) detection algorithms with unified interface☆25Nov 25, 2024Updated last year
- We build a novel self-supervised segmentation pipeline to segment transparent liquids (clear water) placed inside transparent containers.☆26Nov 22, 2022Updated 3 years ago
- Composed Video Retrieval☆62May 2, 2024Updated last year
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆137Apr 10, 2025Updated 10 months ago
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆30Dec 1, 2022Updated 3 years ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆41Dec 23, 2023Updated 2 years ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Mar 7, 2025Updated last year
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- Data repository for the VALSE benchmark.☆37Feb 15, 2024Updated 2 years ago
- 4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022☆43Jul 24, 2023Updated 2 years ago
- NNVisBuilder and some cases including KD-t☆12Nov 18, 2023Updated 2 years ago
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- ☆11Oct 27, 2019Updated 6 years ago
- PyTorch Implementation of the paper "Defining and Quantifying the Emergence of Sparse Concepts in DNNs" (CVPR 2023)☆12Dec 24, 2023Updated 2 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition☆10Oct 31, 2022Updated 3 years ago
- Official implementation of "Attention-aware semantic communications for collaborative inference” (IEEE IoTJ 2024)☆13Jan 22, 2026Updated last month
- Solution of Kaggle competition: MAP - Charting Student Math Misunderstandings☆24Oct 25, 2025Updated 4 months ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time☆46Jun 11, 2024Updated last year
- ☆11May 5, 2022Updated 3 years ago
- Probabilistic Finite Volume Method based on Affine Gaussian Process inference☆11Jun 10, 2024Updated last year
- IRFL: Image Recognition of Figurative Language☆11Nov 30, 2023Updated 2 years ago
- diffusers with search engine☆12Jan 13, 2026Updated last month
- GPU implementation of improved dense trajectory☆10Apr 14, 2015Updated 10 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- ☆12May 8, 2021Updated 4 years ago