danielchyeh / this-is-myView external linksLinks
Official This-Is-My Dataset published in CVPR 2023
☆16Jul 18, 2024Updated last year
Alternatives and similar repositories for this-is-my
Users that are interested in this-is-my are comparing it to the libraries listed below
Sorting:
- ☆11Jul 31, 2022Updated 3 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- ☆18Jan 30, 2023Updated 3 years ago
- ☆13Jul 20, 2024Updated last year
- showing how to use CLIP-Vip to do video search☆16Nov 16, 2023Updated 2 years ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- ActMAD: Activation Matching to Align Distributions for Test-Time-Training (CVPR 2023)☆21Jun 27, 2023Updated 2 years ago
- ☆54Jul 31, 2022Updated 3 years ago
- We build a novel self-supervised segmentation pipeline to segment transparent liquids (clear water) placed inside transparent containers.☆26Nov 22, 2022Updated 3 years ago
- Official PyTorch implementation of Vision DiffMask, a post-hoc interpretation method for vision models.☆32Mar 5, 2024Updated last year
- Composed Video Retrieval☆62May 2, 2024Updated last year
- [ECCV 2024] Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance☆39Sep 7, 2024Updated last year
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆30Dec 1, 2022Updated 3 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆25Jul 11, 2023Updated 2 years ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆41Dec 23, 2023Updated 2 years ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Mar 7, 2025Updated 11 months ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Nov 7, 2023Updated 2 years ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆88Feb 2, 2025Updated last year
- Data repository for the VALSE benchmark.☆37Feb 15, 2024Updated last year
- Official codebase for "Context Aware Deep Learning for Multi Modal Depression Detection" [ICASSP 2019, Oral]☆11Dec 26, 2024Updated last year
- PyTorch Implementation of the paper "Defining and Quantifying the Emergence of Sparse Concepts in DNNs" (CVPR 2023)☆12Dec 24, 2023Updated 2 years ago
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition☆10Oct 31, 2022Updated 3 years ago
- Solution of Kaggle competition: MAP - Charting Student Math Misunderstandings☆23Oct 25, 2025Updated 3 months ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time☆46Jun 11, 2024Updated last year
- IRFL: Image Recognition of Figurative Language☆11Nov 30, 2023Updated 2 years ago
- Anchor Assignment and Sampling Heuristics in Deep Object Detection: A Review☆11Aug 2, 2022Updated 3 years ago
- A method to generate counterfactuals☆12Mar 13, 2025Updated 11 months ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 3 years ago
- Code for WACV24 work for multiview acoustic-visual detection☆13Mar 22, 2024Updated last year
- ☆12May 8, 2021Updated 4 years ago
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.☆10Feb 8, 2025Updated last year
- An overview of popular reranking models and architectures for 2 stage RAG pipelines☆20Jun 10, 2025Updated 8 months ago