danielchyeh / this-is-myView external linksLinks
Official This-Is-My Dataset published in CVPR 2023
☆16Jul 18, 2024Updated last year
Alternatives and similar repositories for this-is-my
Users that are interested in this-is-my are comparing it to the libraries listed below
Sorting:
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 2 years ago
- ☆11Jul 31, 2022Updated 3 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- ☆13Jul 20, 2024Updated last year
- ☆18Jan 30, 2023Updated 3 years ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago
- showing how to use CLIP-Vip to do video search☆16Nov 16, 2023Updated 2 years ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- ActMAD: Activation Matching to Align Distributions for Test-Time-Training (CVPR 2023)☆21Jun 27, 2023Updated 2 years ago
- ☆54Jul 31, 2022Updated 3 years ago
- collection of pitch (f0, fundamental frequency) detection algorithms with unified interface☆24Nov 25, 2024Updated last year
- Composed Video Retrieval☆62May 2, 2024Updated last year
- ☆31Mar 24, 2022Updated 3 years ago
- [ECCV 2024] Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance☆39Sep 7, 2024Updated last year
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆25Jul 11, 2023Updated 2 years ago
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆30Dec 1, 2022Updated 3 years ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Mar 7, 2025Updated 11 months ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆41Dec 23, 2023Updated 2 years ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Nov 7, 2023Updated 2 years ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆88Feb 2, 2025Updated last year
- Data repository for the VALSE benchmark.☆37Feb 15, 2024Updated last year
- NNVisBuilder and some cases including KD-t☆12Nov 18, 2023Updated 2 years ago
- 4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022☆43Jul 24, 2023Updated 2 years ago
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- PyTorch Implementation of the paper "Defining and Quantifying the Emergence of Sparse Concepts in DNNs" (CVPR 2023)☆12Dec 24, 2023Updated 2 years ago
- ☆11Oct 27, 2019Updated 6 years ago
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- Solution of Kaggle competition: MAP - Charting Student Math Misunderstandings☆23Oct 25, 2025Updated 3 months ago
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition☆10Oct 31, 2022Updated 3 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Official codebase for "Context Aware Deep Learning for Multi Modal Depression Detection" [ICASSP 2019, Oral]☆11Dec 26, 2024Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time☆46Jun 11, 2024Updated last year
- ☆11May 5, 2022Updated 3 years ago
- SING: SDE Inference via Natural Gradients☆36Dec 9, 2025Updated 2 months ago