CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
☆63Feb 7, 2022Updated 4 years ago
Alternatives and similar repositories for crossmodal-contrastive-learning
Users that are interested in crossmodal-contrastive-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Video Contrastive Learning with Global Context, ICCVW 2021☆162May 30, 2022Updated 3 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆24Jul 9, 2019Updated 6 years ago
- ☆12Oct 4, 2023Updated 2 years ago
- Source code of the TextLap model, a LLM for text-2-layout generation.☆18Oct 21, 2024Updated last year
- NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.☆17Jan 26, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICML 2025] I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts.☆67May 31, 2025Updated 11 months ago
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆60Jan 20, 2021Updated 5 years ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆106Aug 11, 2023Updated 2 years ago
- ☆31Jun 18, 2021Updated 4 years ago
- ☆21Jul 3, 2019Updated 6 years ago
- (WACV 2021) Temporal Context Aggregation for Video Retrieval with Contrastive Learning☆29Aug 4, 2021Updated 4 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Oct 18, 2021Updated 4 years ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated 2 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆91Oct 24, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Decoupling common and unique representations for multimodal self-supervised learning☆74Aug 14, 2024Updated last year
- [ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆41Nov 29, 2023Updated 2 years ago
- ☆26Jan 12, 2022Updated 4 years ago
- [arXiv22] Disentangled Representation Learning for Text-Video Retrieval☆97Apr 7, 2022Updated 4 years ago
- [WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"☆16Feb 24, 2025Updated last year
- PyTorch code for "Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes" (CVPR, 2022…☆32Jul 8, 2024Updated last year
- ☆45May 20, 2025Updated last year
- Multimodal datasets.☆34Jan 26, 2024Updated 2 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple and effective feature extractor for untrimmed videos☆13Sep 1, 2022Updated 3 years ago
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"☆93Mar 9, 2025Updated last year
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Oct 19, 2022Updated 3 years ago
- ☆259Dec 10, 2022Updated 3 years ago
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency☆17Dec 2, 2021Updated 4 years ago
- ☆87Mar 4, 2024Updated 2 years ago
- Convolutional Neural Network trained for age prediction using a large (n=11,729) set of MRI scans from a highly diversified cohort spanni…☆60Sep 4, 2020Updated 5 years ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- Audio Visual Instance Discrimination with Cross-Modal Agreement☆131Aug 13, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆58Apr 24, 2024Updated 2 years ago
- ☆28Oct 19, 2021Updated 4 years ago
- Code for COLING 2022 paper: Modeling Intra- and Inter-Modal Relations: Hierarchical Graph Contrastive Learning for Multimodal Sentiment A…☆10May 28, 2023Updated 2 years ago
- This code provides the implementation of Semantic Human Matting paper (by Alibaba)☆12Sep 18, 2019Updated 6 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- The backup repository for FairytaleQA dataset and paper "Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset f…☆10May 30, 2023Updated 2 years ago
- ☆11Nov 27, 2019Updated 6 years ago