zjykzj / MPDatasetLinks
Custom Iterable Dataset Class for Large-Scale Data Loading
☆13Updated 3 years ago
Alternatives and similar repositories for MPDataset
Users that are interested in MPDataset are comparing it to the libraries listed below
Sorting:
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Updated last year
- Code for Cross-dataset Training☆15Updated 4 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 2 years ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆14Updated 2 years ago
- ☆13Updated 2 years ago
- ICCV DeeperAction Challenge - Kinetics-TPS Challenge on Part-level Action Parsing and Action Recognition.☆15Updated 4 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Updated 3 years ago
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Updated 3 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated 2 years ago
- A practice for million-scale multi-domain universal object detection☆27Updated 11 months ago
- CV701 Assignment on Pose Estimation☆17Updated 6 months ago
- [IROS 2021] ADD: A Fine-grained Dynamic Inference Architecture for Semantic Image Segmentation☆10Updated 3 years ago
- ☆13Updated 3 years ago
- ☆17Updated last year
- ☆31Updated 2 years ago
- Market-1501 dataset with super-resolution quality☆19Updated 3 years ago
- Pytorch 1.0 codes(including cuda codes) for Deformable Convolution Version 2☆18Updated 6 years ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Updated 3 years ago
- ☆11Updated 7 months ago
- ☆12Updated 3 years ago
- ☆28Updated 2 years ago
- Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces☆35Updated this week
- Vision Longformer For Object Detection☆34Updated 4 years ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆11Updated 3 years ago
- ☆10Updated 3 years ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆37Updated 11 months ago
- Facebook Image Similarity Challenge 2021☆19Updated 3 years ago
- Official Pytorch implementation for Distilling Image Classifiers in Object detection (NeurIPS2021)☆31Updated 3 years ago
- Code for recreating the HoS benchmark of VISOR☆22Updated last year