LiJiaBei-7 / nrccrView external linksLinks
Source code of our MM'22 paper Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning
☆21Jun 20, 2024Updated last year
Alternatives and similar repositories for nrccr
Users that are interested in nrccr are comparing it to the libraries listed below
Sorting:
- Source code of our TCSVT'22 paper Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval☆19Feb 13, 2022Updated 4 years ago
- source code of our MGPN in SIGIR 2022☆18Jun 8, 2022Updated 3 years ago
- Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))☆92Jun 12, 2023Updated 2 years ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆30Apr 19, 2023Updated 2 years ago
- Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"☆29May 31, 2023Updated 2 years ago
- A reading list of papers about Visual Grounding.☆32Aug 24, 2022Updated 3 years ago
- ☆28Feb 2, 2026Updated 2 weeks ago
- 30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days.☆11Dec 15, 2022Updated 3 years ago
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- Pytorch reproduction of paper "Hierarchical Object Detection with Deep Reinforcement Learning"☆10Oct 3, 2023Updated 2 years ago
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated last year
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 2 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆22Jun 23, 2025Updated 7 months ago
- Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrie…☆88Jan 10, 2023Updated 3 years ago
- ☆13Nov 28, 2021Updated 4 years ago
- ☆11Jan 29, 2023Updated 3 years ago
- LSTC: Boosting Atomic Action Detection with Long-Short-Term Context☆10Sep 1, 2022Updated 3 years ago
- ☆10May 16, 2022Updated 3 years ago
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆20Jul 29, 2025Updated 6 months ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- ☆10Feb 28, 2018Updated 7 years ago
- ☆12Mar 30, 2023Updated 2 years ago
- Code for WACV 2023 paper "Out-of-distribution Detection via Frequency-regularized Generative Models" by Mu Cai and Yixuan Li☆11May 1, 2023Updated 2 years ago
- [AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)☆13Jan 10, 2024Updated 2 years ago
- Embodied Instruction Following in Unknown Environments☆17Dec 8, 2025Updated 2 months ago
- [ACL 2025] RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios☆23Jul 2, 2025Updated 7 months ago
- [EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…☆14Mar 4, 2025Updated 11 months ago
- ☆10Sep 24, 2021Updated 4 years ago
- (Neurocomputing) EmoVerse: Enhancing Multimodal Large Language Models for Affective Computing via Multitask Learning☆16Jul 6, 2025Updated 7 months ago
- ☆12Jan 17, 2024Updated 2 years ago
- ☆14Apr 1, 2023Updated 2 years ago
- ☆33May 29, 2025Updated 8 months ago
- [AAAI25] Implementation of paper "WiFi Temporal Activity Detection via Dual Pyramid Network"☆14Aug 26, 2025Updated 5 months ago
- ☆12Oct 12, 2024Updated last year
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- Official PyTorch implementation for the paper "Interpretable Image Classification via Non-parametric Part Prototype Learning" CVPR 2025.☆25Jun 10, 2025Updated 8 months ago
- mainly aimed at scalable subspace clustering☆11Jun 2, 2017Updated 8 years ago