X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024
☆11Nov 7, 2024Updated last year
Alternatives and similar repositories for xmic
Users that are interested in xmic are comparing it to the libraries listed below
Sorting:
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- Dense facial landmarks for thermal imaging☆14Dec 23, 2025Updated 2 months ago
- ☆16Jan 3, 2023Updated 3 years ago
- Temperature Schedules for self-supervised contrastive methods on long-tail data (ICLR'23)☆18Apr 25, 2023Updated 2 years ago
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"☆52Jun 16, 2025Updated 8 months ago
- Flow-Registration toolbox for 2P motion compensation☆27Sep 16, 2025Updated 5 months ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆29Apr 16, 2024Updated last year
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆38Mar 27, 2025Updated 11 months ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Composed Video Retrieval☆62May 2, 2024Updated last year
- Official PyTorch code of GroundVQA (CVPR'24)☆64Sep 13, 2024Updated last year
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆27Nov 29, 2023Updated 2 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- Awesome Vision-Language Pretraining Papers☆40Jan 15, 2025Updated last year
- Official Implementation of "Interpretable 3D Neural Object Volumes for Robust Conceptual Reasoning." ICLR 2026.☆30Feb 3, 2026Updated 3 weeks ago
- UCAS 数据挖掘课程项目 Option 1: 2020 CCF 大数据与计算智能大赛 风电机组异常数据识别与清洗☆10Aug 15, 2021Updated 4 years ago
- Official implementation of `Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning`, CVPR 2025☆13Aug 1, 2025Updated 7 months ago
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated last month
- Improving Continuous Sign Language Recognition with Adapted Image Models☆14Nov 10, 2025Updated 3 months ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- Multimodal language model benchmark, featuring challenging examples☆184Dec 18, 2024Updated last year
- ☆11May 17, 2024Updated last year
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated 3 weeks ago
- ☆11Sep 1, 2024Updated last year
- [ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization☆12Oct 8, 2024Updated last year
- Searching for a Strategy: Modelling Player Trajectories in Soccer Games using Social LSTM☆16Dec 20, 2017Updated 8 years ago
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Jun 11, 2023Updated 2 years ago
- ☆11Aug 7, 2025Updated 6 months ago
- In this project, facial recognition algorithm is implemented with python using PCA and SVD dimensionality reduction tools.☆10Sep 2, 2019Updated 6 years ago
- ☆10Jul 5, 2024Updated last year
- The Pytorch implementation for "GraphMDN: Leveraging graph structure and deep learning to solve inverse problems" (IJCNN 2021).☆18Jul 26, 2021Updated 4 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- Ranking-Consistent Language-Image Pretraining☆12Oct 24, 2025Updated 4 months ago
- ☆10Mar 30, 2023Updated 2 years ago
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆45Apr 9, 2025Updated 10 months ago
- ☆12Oct 10, 2023Updated 2 years ago
- [ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap☆12Jun 18, 2025Updated 8 months ago
- "Roll with the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning" by Yue Duan (AAAI 2024…☆13Nov 20, 2025Updated 3 months ago