alibaba / EssentialMC2
EssentialMC2 Video Understanding.
☆113Updated 2 years ago
Alternatives and similar repositories for EssentialMC2:
Users that are interested in EssentialMC2 are comparing it to the libraries listed below
- This repository contains the dataset, codebase, and benchmarks for our paper: <CNVid-3.5M: Build, Filter, and Pre-train the Large-scale P…☆25Updated last year
- A real-time GNN-based method. Understanding Image Retrieval Re-Ranking: A Graph Neural Network Perspective☆77Updated 4 years ago
- Product1M☆87Updated 2 years ago
- 🏆 The 1st Place Submission to AICity Challenge 2021 Natural Language-Based Vehicle Retrieval Track (Alibaba-UTS submission)☆94Updated 3 years ago
- [ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources☆44Updated 2 years ago
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆59Updated 4 years ago
- [NeurIPS 2020] Code for the paper "Balanced Meta-Softmax for Long-Tailed Visual Recognition"☆74Updated 4 years ago
- Self-distillation with Batch Knowledge Ensembling Improves ImageNet Classification☆81Updated 3 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Updated 2 years ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆41Updated 2 years ago
- A memory balanced and communication efficient FullyConnected layer with CrossEntropyLoss model parallel implementation in PyTorch☆85Updated 4 years ago
- Code release for "Self-Tuning for Data-Efficient Deep Learning" (ICML 2021)☆110Updated 3 years ago
- WuDaoMM this is a data project☆73Updated 2 years ago
- PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022☆165Updated 2 years ago
- An optimized re-implementation for 2D-TAN: Learning 2D Temporal Localization Networks for Moment Localization with Natural Language (AAAI…☆126Updated 2 years ago
- Ranking-based-Instance-Selection☆32Updated 3 years ago
- OpenCompatible provides a standard compatible training benchmark, covering practical training scenarios.☆25Updated 2 years ago
- ☆34Updated 2 years ago
- An open-source project for long-tail classification☆39Updated 3 years ago
- [VisDA2020 1st Place] Our solution to Domain Adaptive Pedestrian Re-identification in VisDA2020☆57Updated 4 years ago
- Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).☆138Updated 2 years ago
- Large-Scale Pre-training for Person Re-identification with Noisy Labels (LUPerson-NL)☆75Updated 2 years ago
- Replication of Pix2Seq with Pretrained Model☆60Updated 3 years ago
- A full-fledged version of Pix2Seq☆237Updated 3 years ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆180Updated last year
- ☆108Updated 2 years ago
- A PyTorch implementation of VIOLET☆137Updated last year
- Official pytorch implementation of paper "VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples" (CVP…☆146Updated 3 years ago
- Pytorch code for Towards Backward-Compatible Representation Learning [CVPR 2020 Oral]☆54Updated 3 years ago
- General Vision Benchmark, GV-B, a project from OpenGVLab☆189Updated 3 years ago