CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
☆62Feb 7, 2022Updated 4 years ago
Alternatives and similar repositories for crossmodal-contrastive-learning
Users that are interested in crossmodal-contrastive-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Apr 6, 2021Updated 5 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆162May 30, 2022Updated 4 years ago
- PyTorch Implementation of Attention Prompt Tuning: Parameter-Efficient Adaptation of Pre-Trained Models for Action Recognition☆16Mar 12, 2024Updated 2 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆24Jul 9, 2019Updated 6 years ago
- Repo of NeurIPS23☆17Oct 25, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Oct 4, 2023Updated 2 years ago
- NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.☆17Jan 26, 2024Updated 2 years ago
- [ICML 2025] I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts.☆69May 31, 2025Updated last year
- ☆18Jan 13, 2021Updated 5 years ago
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆60Jan 20, 2021Updated 5 years ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆106Aug 11, 2023Updated 2 years ago
- pre-trained vision and language model summary☆12Apr 20, 2021Updated 5 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- ☆31Jun 18, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆154Aug 21, 2024Updated last year
- Official Pytorch implementation for AAAI2021 paper (RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning)☆38Nov 5, 2021Updated 4 years ago
- ☆21Jul 3, 2019Updated 6 years ago
- (WACV 2021) Temporal Context Aggregation for Video Retrieval with Contrastive Learning☆29Aug 4, 2021Updated 4 years ago
- The following scripts are available in order to be able to reproduce the experiments carried out in...☆11Oct 2, 2024Updated last year
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated 2 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆91Oct 24, 2022Updated 3 years ago
- Decoupling common and unique representations for multimodal self-supervised learning☆75Aug 14, 2024Updated last year
- [ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆41Nov 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [arXiv22] Disentangled Representation Learning for Text-Video Retrieval☆97Apr 7, 2022Updated 4 years ago
- [WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"☆16Feb 24, 2025Updated last year
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"☆95Mar 9, 2025Updated last year
- A simple and effective feature extractor for untrimmed videos☆13Sep 1, 2022Updated 3 years ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Oct 19, 2022Updated 3 years ago
- Video content description model for generating descriptions for unconstrained videos☆15Jul 5, 2019Updated 6 years ago
- ☆87Mar 4, 2024Updated 2 years ago
- ☆29Nov 23, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- Audio Visual Instance Discrimination with Cross-Modal Agreement☆132Aug 13, 2021Updated 4 years ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- Implementation of FGANomaly☆17Sep 22, 2021Updated 4 years ago
- Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)☆38Dec 19, 2021Updated 4 years ago
- A PyTorch implementation of ACNet based on TCSVT 2023 paper "ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image…☆10Dec 8, 2023Updated 2 years ago
- ☆28Oct 19, 2021Updated 4 years ago